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METHODS AND COMPOSITIONS FOR THE DIAGNOSIS AND 
TREATMENT OF BODY WEIGHT DISORDERS , INCLUDING OBESITY 

Priority of provisioned application nc . 60/092 , 63C filed 
5 on July 21, 1998 and of provisional application no. 
60/104,978 filed on October 20, 1996, each of which is 
incorporated herein by reference in its entirety, is claimed 
under 35 U.S.C. § 119(e)(1)*. 

10 1. INTRODUCTION 

The present invention relates to mammalian mahogany 
genes, including the human mahogany gene, which are novel 
genes involved in the control of mammalian body weight. The 
invention encompasses nucleotide sequences of the mahogany 
gene, host cell expression systems of the mahogany -gene, and 
hosts which have been transformed by these expression 
systems, including transgenic animals. The invention also 
encompasses novel mahogany gene products, including 'mahogany- 
proteins, polypeptides and peptides containing amino acid 
sequences mahogany proteins, fusion proteins of mahogany 

20 proteins polypeptides and peptides, and antibodies directed 
against such mahogany gene products. 

The present invention also relates to methods and 
compositions for the diagnosis and treatment of mammalian 
body weight disorders, including obesity, cachexia, and 

25 anorexia, and for the identification of subjects susceptible 
to such disorders. Further, the invention relates to methods 
of using the mahogany gene and gene products of the invention 
for the identification of compounds which modulate the 
expression of the mahogany gene and/or the activity of the 
mahogany gene product. Such compounds ran be useful as 
therapeutic agents in the treatment of mammalian body weight 
disorders, including obesity, cachexia, and anorexia. 
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2 . BACKGROUND OF THE INVENTION 

Obesity represents the most prevalent of body weight 
disorders, and it is the most important nutritional disorder 
in the western world, with estimates of its prevalence 
5 ranging from 30% to 50% within the middle-aged population. 
Other body weight disorders, such as anorexia nervosa and 
bulimia nervosa, which together affect approximately 0.2% of 
the female population of the western world, also pose serious 
health threats. Further, such disorders as anorexia and 

10 cachexia < wastin 9) are also prominent features of other 
diseases such as cancer, cystic fibrosis, and AIDS. 

Obesity, defined as an excess of body fat relative to 
lean body mass, also contributes to other diseases. For 
example, this disorder is responsible for increased incidence 
of diseases such as coronary artery disease, hypertension, 

15 stroke, diabetes, hyper lipidemia , and some cancers <See, 

e.g., Nishina, P.M. etal., 1994, Metab. 43: 554-558; {Srundy, 
S.M. Sc Barnett, J. P., 1990, Dis. Mon. 36: 641-731). Obesity 
is not merely a behavioral problem, i.e., the result of 
voluntary hyperphagia. Rather, the differential body 

2 0 composition observed between obese and normal subjects 

results from differences in both metabolism and 
neurologic/metabolic interactions. These differences seem to 
be, to some extent, due to differences in gene expression, 
and/or level of gene products or activity (Friedman, J.M. et 
25 al . , 1991, Mammalian Gene 1: 130-144). 

The epidemiology of obesity strongly shows that the 
disorder exhibits inherited characteristics (Stunkard, 1990, 
N. Eng. J. Med. 322: 1438). Moll et al . have reported that, 
in many populations, obesity seems to be controlled by a few 
genetic loci (Moll et al . , 1991, Am. J. Hum. Gen. 49: 1243). 

3 0 

In addition, human twin studies strongly suggest a 
substantial genetic basis in the control of body weight, with 
estimates of heritability of 80-90% (Simopoulos , A. P. & 

- 2 - 
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Childs, E., eds., 1989, in "Genetic Variation and Nutrition 
in Obesaty" , World Review of Nutrition and Diabetes €3, S. 
Karger, Basel, Switzerland; Borjeson, M . / 1976, Acta . 
Paediatr. Scand. 65: 279-287). 
5 In other studies, non-obese persons who deliberately 

attempted to gain weight by systematically over-eating were 
found to be more resistant to such weight gain and able to 
maintain an elevated weight only by very high caloric intake. 
In contrast, spontaneously obese individuals are able to 
maintain their status with normal or only moderately elevated 

10 

caloric intake. In addition, it is a commonplace experience 
in animal husbandry that different strains of swine, cattle, 
etc., have different predispositions to obesity. Studies of 
the genetics of human obesity, and of animal models of 
obesity demonstrate that obesity results from complex 
15 defective regulation of both food intake, food induced energy 
expenditure, and of the balance between lipid and lean body 
anabolism. 

There are a number of genetic diseases in man and other 
species which feature obesity among their more prominent 
2Q symptoms, along with, frequently, dysmorphic features and 

mental retardation. For example, Prader-Willi syndrome (PWS; 
reviewed in Knoll, J.H. et al . , 1993, Am. J. Med. Genet. 46: 
2-6) affects approximately 1 in 20,000 live births, and 
involves poor neonatal muscle tone, facial and genital 
deformities, and generally obesity. 

25 

In addition to PWS, many other pleiotropic syndromes 
have been characterized which include obesity as a -symptom. 
These syndromes are genetically straightforward, and appear 
to involve autosomal recessive alleles. -Such diseases 
include, among others, Ahlstroem, -Carpenter, -Bardet-Biedl , 
30 Cohen, and Morgagni -Stewart -Monel Syndromes. 

A number of models exists for the study of obesity (see, 
e.g., Bray, G. A., 1992, Prog. Brain Res. 93: 333-341; and 

- 3 - 
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Bray, G.A., 1989, Amer. J. Clin.Nutr. 5: 891-902). For 
example, animals having mutations which lead to syndromes 
that include obesity symptoms have also been identified. 
Attempts have been made to utilize such animals as models for 
5 the study of obesity, and the best studied animal models to 
date for genetic obesity are mice. For reviews, see, e.g., 
Friedman, J.M. et al . , 1991, Mamm. Gen. 1; 130-144 ; Friedman, 
J.M. and Liebel, R.L., 1992, Ceil 69: 217-220. 

Studies utilizing mice have confirmed that obesity is a 
10 Very com P lex trait with a high degree of heri tability . 

Mutations at a number of loci have been identified which lead 
to obese phenotypes . These include the autosomal recessive 
mutations obese (oJb) , diabetes (db) , fat (fat) , and tubby 
( tuJb) . 

The dominant Yellow mutation (Ay) at the agouti locus 
15 causes a pleiotropic syndrome which causes moderate adult 
onset obesity, a yellow coat color, and a high incidence of 
tumor formation (Herberg, L. and Coleman, D.L., 1977, 
Metabolism 26:59), and an abnormal anatomic distribution of 
body fat (Coleman, D.L., 1978, Diabetologia 14:141-148). The 

20 mutation causes the widespread expression of a protein which 
is normally seen only in neonatal skin (Michaud, E. J. et 
al., 1994, Genes Devel . 8:1463-1472). The agouti protein has 
been reported to be a competitive antagonist of a-MSH binding 
to the melanocortin receptors MCI -R and MC4-R in vitro (Lu et 

25 al., 1996, Nature 3 71:799-802), and the authors speculated 

that de-regulated ubiquitous expression of agouti may lead to 
obesity by antagonism of. melanocortin receptors expressed 
outside the hair follicles. 

Mahogany (mg) and mahoganoid {md) are mutations that 

^ suppress the phenotypic effects of agouti protein in vivo 

(Lane and Green, 1960, J. Hered. 51: 228-230). The mahogany 
and mahoganoid mutation have been mapped to mouse chromosomes 
2 and 16, respectively «5reen, 1989, "Catalog of mutant genes 

'-. 4 - 
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and polymorphic loci", pp. 12-403 in Cenetic Variants and 
Straine of the Laboratory Mouse , Lyon, M. F. and Searie , 
A.G., eds Oxford University Press, Oxford). Mutations of 
both mg and md have been shown to suppress the effects of 
c agouti cn obesity as well as on coat color (Miller et al . , 
19S7, Genetics 146: 1407-1415). 

In summary, therefore, obesity, which poses a major, 
worldwide health problem, represents a complex, highly 
heritable trait. Given the severity, prevalence, and 
potential heterogeneity of such disorders, there exists a 
great need for the identification of those genes that 
participate in the control of body weight. 

3 _ SUMMARY OF THE INVENTION 

The present invention relates to the identification of 

15 novel nucleic acid molecules and proteins encoded by such 
nucleic acid molecules that are involved in the control of 
mammalian body weight, and which, further, are associated 
with mammalian body weight disorders such as obesity, 
cachexia, and anorexia. The nucleic acid molecules of the 

2 0 present invention represent the genes corresponding to the 
mammalian mahogany gene, including the human mahogany -gene. 

In particular, the compositions of the present invention 
include nucleic acid molecules which comprise the following 
sequences: (a) nucleotide sequences of the mahogany gene, 
including, e.g. . murine mahogany sequences as shown in FK3S . 

25 2A, 3B-D, 6A-B, 8A, and 9A, as well as allelic variants and 
homologs thereof, and human mahogany sequences, as shown, 
e.g. . in FIGS. 10A, 18A, 19A and 20A, as well as all-elic 
variants and homologs thereof; <b) nucleotide sequences that 
encode the mahogany gene product amino acid -sequences, as 

30 shown, e.g. .in in FIGS. 2B, 8B, 9B, 10B, 17, 18B, 19B and 
2 0B; (c) nucleotide sequences that encode portions of the 
mahogany gene product corresponding to its functional -domains 

- "5 - 
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and individual exons ; fd) nucleotide sequences comprising the 

nove _i mahooanv aene seouence? n *i ^^"i ^co^. ^ ^ ^ ----- . ^a*- 

mutants of the mahogany- gene product in which all or a Dart 
of one or more of the domains is deleted or altered, as 
5 shown, e.g. , m FIG. 6; (e) nucleotide sequences that encode 
fusion proteins comprising the mahogany gene procuc: , or one 
or more of its domains fused to a heterologous polypeptide; 
(f) nucleotide sequences within the mahogany gene, as well as 
chromosome sequences flanking the mahogany gene, see, e..c . , 
FIG. 1, which can be utilized as part of the methods of the 
present invention for the diagnosis of mammalian oody weight 
disorders, including obesity, cachexia, and anorexia, which 
are mediated by the mahogany gene, as well as for the 
identification of subjects susceptible to such disorders; (g) 
nucleic acid sequences that hybridize to the above described 
15 sequences under stringent or moderately stringent conditions, 
particularly human mg homologs. The nucleic acid molecules 
of the invention include, but are not limited to, cDNA and 
genomic DNA sequences of the mahogany gene. 

The present invention also encompasses expression 
20 products of the nucleic acid molecules listed above; i.e., 
proteins and/or polypeptides that are encoded by the above 
mahogany nucleic acid molecules. 

Agonists and antagonists of the mahogany gene and/or 
gene product are also included in the present invention. 
Such agonists and antagonists will include, for example, 
small molecules, large molecules, and antibodies directed 
against the mahogany gene product. Agonists and antagonists 
of the invention also include nucleotide sequences, such as 
ant i sense and ribozyme molecules, and gene or regulatory 
sequence replacement constructs, that can be used to inhibit 
30 or enhance expression of the mahogany gene . 

The present invention further encompasses cloning 
ctors, including expression vectors, that -contain the 

- *6 - 



vet 



WO 00/05373 PCT/US99/1h484 



nucleac acid moiecuies of the invention and can be used tc 
express those nucleic acid molecules ir. host organisms. The 
present invention also relates to host ceils engineered tc 
contain and/or express the nucleic acid moiecuies of the 
5 invention. Further, host organisms which have been 
transformed with these nucleic acid moiecuies are also 
encompassed in the present invention. Host organisms of the 
invention include organisms transformed with the cloning 
vectors described above, e.g., transgenic animals, 
particularly non-human transgenic animals, and particularly 

10 ' - i 

transgenic non- Human mammals. 

The transgenic animals of the invention include animals 
that express a mutant variant or polymorphism of a mahogany 
gene, particularly a mutant variant or polymorphism of a 
mahogany gene that is associated with a weight disorder such 
15 as obesity, cachexia, or anorexia. The transgenic animals of 
the invention further include those that express a mahogany 
transgene at higher or lower levels than normal. The 
transgenic animals of the invention further include those 
which express the mahogany gene in ell their cells, "mosaic" 
20 animals which express the mahogany gene in only some of their 
cells, and those in which the mahogany gene is selectively 
introduced into and expressed in a specific cell type(s). 
The transgenic animals of the invention also include "knock- 
out" animals. Knock-out animals comprise animals which have 
been engineered to no longer express the mahogany gene. 
25 The present invention also relates to methods and 

compositions for the diagnosis of mammalian body weight 
disorders, including obesity, cachexia, and anorexia, as well 
as for the identification of subjects susceptible to such 
disorders. Such methods comprise, for example, measuring 
30 expression of the mahogany gene in a patient sample, or 

detecting a mutation in the mahogany gene in the genome of a 
mammal, including a human, suspected of exhibiting such a 

- 7 - 
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weight disorder. The nucleic acid molecules of the invention 
can also be used as diagnostic hybridization probes, or as 
primers for diagnostic PCR analysis to identify of mahogany 
gene mutations, allelic variations, or regulatory defects, 
5 such as defects in the expression of the mahogany gene. Such 
diagnostic PCR analyses can be used to diagnose individuals 
with a body weight disorder associated with a particular 
mahogany gene mutation, allelic variation, or regulatory 
defect. Such diagnostic PCR analyses can also be used to 
identify individuals susceptible to such body weiqht 

10 

disorders and hyperphagia. 

Methods and compositions, including pharmaceutical 
compositions, for the treatment of body weight disorders such 
as obesity, cachexia, and anorexia are also included in the 
invention. Such methods and compositions are capable of 
15 modulating the level of mahogany gene expression and/or the 
level of activity of the mahogany gene product. Such methods 
include, for example, modulating the expression of the 
mahogany gene and/or the activity of the mahogany gene 
product for the treatment of a body weight disorder which is 
20 mediated by some other gene, for example by the agouti gene. 
The invention still further relates to methods for 
identifying compounds which modulate the expression of the 
mammalian mahogany gene and/or the synthesis or activity of 
mammalian mahogany gene products. Such compounds include 
^ therapeutic compounds which can be used as pharmaceutical 

compositions to reduce or eliminate the symptoms of mammalian 
body weight disorders such as obesity, cachexia, and 
anorexia. Cellular and non-cellular assays are described 
that can be used to identify compounds that interact with the 
mahogany gene and/or gene product, e.g., modulate the 
30 activity of the mahogany gene and/or bind to the mahogany 
gene product. Such cell -based assays of the invention 
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utilize cells, cell lines, or engineered cells -or cell lines 
that express the mahogany gene product. 

In one embodiment, such methods comprise contacting a 
compound to a cell that expresses a mahogany -gene, measuring 
5 the ievei of mahogany gene expression, gene product 

expression, or gene product activity, and comparing this 
level to the" level of mahogany gene expression, g-ene product 
expression, or gene product activity produced by the cell in 
the absence of the compound, such that if the level obtained 
in the presence of the compound differs from that obtained in 

10 its absence, a compound that modulates the -expression of the 
mammalian mahogany gene and/or the synthesis or activity of 
mammalian mahogany gene products has been identified. 

In an alternative embodiment, such methods comprise 
administering a compound to a host, e.g., a transgenic -animal 

15 that expresses a mahogany transgene or a mutant mahogany 
transgene, and measuring the level of mahogany gene 
expression, gene product expression, or gene product 
activity. The measured level is compared to the level of 
mahogany gene expression, gene product expression, or gene 

20 product activity in a host that is not exposed to the 

compound, such that if the level obtained when the host i-s 
exposed to the compound differs from that obtained when the 
host is not exposed to the compound, a compound -that 
modulates the expression of the mammalian mahogany gene 
and/or the synthesis or activity of mammalian mahogany gene 

25 

products, and/or the symptoms of a mammalian -body weight 
disorder, such as obesity, cachexia, or anorexia, has been 
identified . 

The Example presented in Section 6, below, describes the 
genetic and physical mapping of the mahogany gene to a 
30 specific 700 kb interval of mouse chromosome 2. The example 
presented in Section 7, below, describes -the identification 
of a transcription unit within this chromosome interval, 

- 9 - 
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referred to herein as the MG gene, which represents the 
mahogany gene. The expression and sequence- analysis of this 
candidate mahogany gene is described in the example presented 
in Section 8, below. These experiments prove that the 
5 candidate gene MG is indeed the mahogany gene. The example 
presented in Section 9, below, presents data demonstrating 
that the mechanism of mahogany action is specific for diet- 
induced obestity, therefore supporting the use of mahogany 
antagonists as a specific therapeutic for treatment of diet- 
induced body weight disorders. The example presented in 

10 

Section 10, below, presents the identification and 
characterization of the human mg gene, variants thereof and 
polypeptides encoded by the human mahogany sequences. 

DEFINITIONS 

15 As used herein, the following terms shall have the 

abbreviations indicated. 

BAC, bacterial artificial chromosomes 

bp, base pair(s) 

EST, expressed sequence tag 
2 0 m 9 * mahogany gene 

RFLP, restriction fragment length polymorphism 

RT-PCR, reverse transcriptase PCR 

SSCP, single -stranded conformational polymorphism 
SSLP, simple sequence length polymorphisms 
25 STS, short tag sequence 

YAC, yeast artificial chromosome 

4. BRIEF DESCRIPTION OF THE FIGURES 

f IG : — 1 • Physical map of the mahogany interval of mouse 
chromosome 2 . 



30 



FIG - 2 » Panel A(l)-A(3): cDNA nucleotide sequence of 
the wild-type {C57BL/6J) murine mahogany gene (SEQ ID NO: 1) , 
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including the 5' and 3' untranslated regions, and Panel B: 
the derived amino acid sequence (SEQ ID NO: 2) of the 
mahogany gene product . 

FIG. 3. Genomic structure and nucleotide sequences 
5 ' 

derived from the wild- type (C57BL/6J) mouse genomic regions 

containing the mg gene. Panel A, genomic structure; Panel 

B(l)-B(9), genomic sequence c56 (SEQ ID NO: 3); Panel C(l)- 

C(4), genomic sequence c96 (SEQ ID NO: 4); Panel D(l)-D<37), 

aenomic seouence of cllO/111 (SEQ ID NO: 5) . 
10 " 

FIG. 4. Structural depiction of MG cDNA without 
introns. CUB=CUB domain, metal=metallothionin domain; T- 
t ransmembrane domain. 

15 FIG. 5 (1) -5 (4) . Nucleotide sequence of primers used 

to amplify each of the exons in the mg gene. 

FIG, 6. Nucleotide sequence of the wild -type <SEQ ID 
NO: 6) and mahogany mutant (SEQ ID NO: 7) sequences in exon 
20 15 of the MG gene. Bases shown in bold are deleted in Mg3J 
mutant mg. 

FIG. 7. Differential 5 1 start sequences in the murine 
mahogany gene showing splice forms akml*0O3 and akmlO04 . 

25 

FIG. 8. Panel A, cDNA sequence (SEQ ID NO: 8) from one 
form of the differential 5' start site found in the murine 
(akml003) , Panel B, amino acid sequence (SEQ ID NO: 9) 
encoded by the cDNA of Panel A; Panel C, hydropathy plot of 
the akml003 amino acid sequence. 

30 

FIG. 9. Panel A, cDNA sequence (SEQ ID NO: lO) from 
one form of the differential 5' start site found in the 

- 11 - 
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murine 



•:aKml004); Panel B, amino acid sequence (SEQ ID 
NO: 22} encoded by the cDNA of Panel A; Panel C, hydropathy 
piot or tne aKml004 amino acid sequence. 



"IC-. 10 



Nucleotide sequence (SEQ ID NO: 12) of a 



cent ie containing, a portion of the human MG cDNA, panel A<1) 
AU) anc the translated amino acid sequence ( SEQ ID NO: -13), 
panel E. 



10 FIG " "• ■ Effect of ™9 on MC4r -/- induced weight gain 

an females (FIG. li A) and males (FIG. lie) , values depicted 
are the mean + /- SD within a designated time interval. 

FIG - 12 ' Ef f6Ct of m 3 °n monogenic obese mutants Lepr^ 
^ (FIG. 12A), tub (FIG. 12B), Cpe- (FIG . 12C) , and on high fat 
daet induced obesity (fig. 12D) ; the values indicated are 
the mean +/- SD of the weight length ratio for each animal 



FIG - 13 ' Genetic and Physical map of the region 
20 surrounding the mg locus; all MIT markers are presented with 
shortened names, e.g., D2MIT77 is indicated as D2M77 • 
locations of loci which also mapped on the human cytogenetic 
map are indicated in parentheses after the gene symbol . 

■ FIG - 13A - The 9«netic map of the mg gene region on 
25 the Millennium BSB mapping panel (Misumi, D.J. e t al . , 

1997, Science 275:135-138); 

FIG - 13B> The Senetic map obtained from crosses 
segregating mg mutant alleles; 

FIG - 13C ' The - 1 Mb BAC contig across the mg gene 
3 0 region of mouse Chromosome 2; 

FIG ' 13D - - T he transcriptional units identified in 
the mg region; the filled box indicates the mg gene. 
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whereas the hat cned box is a member cf the H:ch Mobility 
Group (HMG: gene family which sits betweer. coding exons 
21 and 22 c: the /no gene. 

FIG . 14 . Northern blot analysis with CSK/HeJ (lane 2: 
and three mutant alleles of mg : C3HeE/FeJ -/ncr 3 *-" (Lane 2), 
LDJ/Le-mc (Lane 3), and C3K/HeJ-/ngr" (Lane 4) ; the size marker 
is shown on the left, and hybridization with actin is shown 
below for loading comparisons. 

FIG . 15 . In situ hybridization data: FIG. ISA 
demonstrates widespread expression of mg throughout the mouse 
brain is seen in an antisense autoradiographic image of a 
C3H/HeJ brain at the level of the 3rd ventricle; decreased 
expression m mg mutants is documented in selected antisense 
darkfieid images of 10 fim whole mount cross sections of the 
ventromedial hypothalamic nucleic (VMH) cf C3K/HeJ (FIG. 
15B) , LDJ/Le-mg (FIG. 15C) , and C3HeB/Fe J -mg 3J (FIG. I5D). 

FIG . 16 . Alignment of the MG protein sequence with its 
2 0 family members showing the transmembrane region (indicated in 
brackets) and cytoplasmic tail (FI*G. 16A) ; and a schematic of 
the molecular modular architecture of MG (FIG 16*B) . 

FIG. 17A-C . Sequence alignment of the predicted MG 
25 protein sequence (top) with the At tract in protein sequence . 
Characteristic MG domains are as indicated. See Section 10.2 
for details. 

FIG. 18A-B. Panel A: cDNA nucleotide sequence (SEQ ID 
3Q NO: 14) of the long splice variant of the human ortholog of 
the mahogany gene, and Panel B: the .derived ami-no a ci<i , 
sequence (SEQ ID NO: 15) of the mahogany -gene product which 
it encodes. 

- 13 - 
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FIG • — 19A ~ E ■ Panel A: cDNA nucleotide sequence (SEQ ID 
NO: 16) of a shorter splice variant of the human ortholog of 
the mahogany gene, and Panel B: the derived amino acid 
sequence (SEQ ID NO: 17) of the mahogany gene product which 

c, 2 r encodes . 

FIG. 20A-B. Panel A: cDNA nucleotide sequence (SEQ ID 
NO: 18) of a second shorter splice variant of the human 
ortholog of the mahogany gene, and Panel B: the derived 
io ammo acid sequence (SEQ ID NO: 19) of the mahogany gene 
product which it encodes. 

5 • DETAILED DESCRIPTION OF THE INVENTION 

Described herein is the identification of the novel 
mammalian mahogany (mg) gene, including the human mahogany 
gene, which is involved in the control of mammalian body 
weight. Also described are recombinant mammalian, including 
human mahogany DNA molecules, cloned genes, and degenerate 
variants thereof. The compositions of the present invention 
further include mg gene products (e.g., proteins) that are 
20 encoded by the mg DNA molecules of the invention, and the 
modulation of mg gene expression and/or mg gene product 
activity in the treatment of mammalian body weight disorders, 
including obesity, cachexia, and anorexia. Also described 
herein are antibodies against, mg gene products (e.g., 
25 proteins), or conserved variants or fragments thereof, and 
nucleic acid probes useful for the identification of mg gene 
mutations, and the use of such nucleic acid probes in 
diagnosing mammalian body weight disorders, including 
obesity, cachexia, and anorexia. Further described are 
methods for the use of the mg gene and/or mg gene products in 
the identification of compounds which modulate the activity 
of the mg gene product . 
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5.1. THE MAHOGANY GENE 

The mahogany genes are novel mammalian -genes involved in 
the contrci of body weight. The nucleic acid sequences of 
the mahogany genes, including the murine mahogany gene 
sequences shown in FIGS. 2A, 3B-D, 6A-B, 8A, and 9A, as well 
as allelic variants and homo logs thereof, and human mahogany 
sequences, as shown, e.g. , in FIGS. 10A, 1£A, 19 A and 20A, as 
well as allelic variants and homo logs thereof. The genomic 
sequence and structure, i.e., the intron/exon -structure, of 
the mahogany genes have also been elucidated, FI*G. 3. 

The mahogany gene nucleic acid molecules of the present 
invention comprise: (a) the DNA sequence shown in FIGS . 2A, 
3, 6A-B, 8A, 9A, 10A, 18A, . 19A or 2 OA, or any DNA sequence 
that encodes the amino acid sequence of the mahogany gene 
product shown in FIGS. 2B, 8B, 9B, 10B, 17, 18B, 19B or 2-OB; 

(b) nucleotide sequences comprising the novel mahogany 
sequences disclosed herein that encode mutants of the 
mahogany gene product in which all or a part of one or -more 
of the domains is deleted or altered, as shown, e . g . , FIG. £; 

(c) nucleotide sequences that encode fusion proteins 
comprising a mahogany gene product, or one of its domains 
fused to a heterologous polypeptide; and (d) nucleotide 
sequences within a mahogany gene, nucleotide sequences on the 
chromosome flanking the mahogany gene , see, e . <? . . FIG. 3 and 
human genomic sequences syntenic to the sequences depicted in 
FIG. 3, which can be utilized as part of the methods of the 
invention for identifying and diagnosing individuals who 
exhibit or are susceptible to weight disorders, including 
obesity, cachexia, and anorexia. 

The mahogany nucleotide sequences -of the invention 
further comprise: (a) any nucleotide sequence that 
hybridizes to the complement of a nucleic acid molecule that 
encodes a mahogany gene product under highly stringent 
conditions , e.g., hybridization to filter-bound DNA in 0 . 5 M 
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- EDTA at 6 5°C, 

Ausubei F.M. e z a: . 
r -oology , Vol . I 



and jonr: Wiiev ^ .<;nnc 



NaHPG 4 , Ik sodium dodecy] sulfate (SDS), 
and washing m C . 2xSSC/C . 1% SD£ at 6£^ 

eds . , 298° i- 

.urre... -Totccois an Molecui 

Green Publishing Associates, inc' 

^ i , w _ tr V <v 

c -inc., New York a- - - n -> " 

- - particularly numan mc 

sequences, FIG u.- = -,^ 

h K . ( ; 3ny nuCJeotice sequence that 

hybnaizes tc the complement of a nuclei - ^ n - - 

o nucie„, ccic molecui e thai 
encodes a mahogany cene product under less st repent 
conditions, such as moderately stringent conditions. 



io e.g., washing m 0.2xSSC/0.l% SDS at 42 L 



(Ausubei et aJ 

±9e9 ' SUPrS) ' enco des a functionally " 

equivalent mahogany gene product. 

"Functionally equivalent", as utilized 

--^-ejLn,. rere vc 

a gene product tec - ■ r ^ 

>e.c., c protein) capable c: exhibitino a 
substantial iv c 1Fl r,-v .v , ■ - 
15 , — ^ V1V ° cavity as the endogenous mc 

gene oroaucts encoder >-•• r^^. 

6C the m 5 ^ene sequences describee 
aoove. The in vivo activity of rh^ m „ 

ty or the m ? 9ene product a- useH 
herein, refers to the ability of' the mn 

cy ot the m 9 gene product, when 
present m an appropriate cell type to - m ~i 

<^j.j. type , CO ameliorate nrevonf 

or delay the appea-ance o- t-H„ u Prevent, 
2ft PPe-ance ox the mahogany phenotype relative to 

20 its appearance when that cell type lack. - «= • 

type lacks = functional 

mahogany gene product . 

The invention a lso includes nucleic a cid molecule, 
pre erably DNA molecules, that are the comple.ents of the 
nucieot.de seouences described above. Am ong the nucl.ic a cid 
25 molecules of the invention a re deoxyoligonudeotides 
< olxso..) which hybridize under highly stringent or 

m m lTr ly d Strin9ent COnditi0 " S '» "» nucleic a cid 

molecules described a bove. Exemplary nighly st ^ 

conditions may refer e a ^„ 

y eier, e.g., to washing in 6xSSC/0 . 05% sodium 

so pyrophosphate at 3 rc «f or 14 . b ase oligos, . „-c (for " base 
019M ' 5B C 2 °- b ™' o»*.). and so'c (for 23 -ba"e 

olrgos, . These nucleic acid nolecul.s m ay encode or act as 
ant.sense .olecules. useful, for ex am p le . in mahogany ^ 
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regulation, and /or as ant i sense primers in ampi i f i ca t i on 
reactions c: ^anoqany gene nucleic acid seauer.ces with 
respect tc mancgany gene regulation, such techniques can be 
used tc regulate, for example, weight disorders sue h as 
5 obesity, cachexia, or anorexic. Such sequences may also be 
used as part c : r ibozyme and /or triple helix sequences, which 
are also useful for mahogany gene regulation. Still further, 
such molecules may be used as components of diagnostic 
methods whereby, for example, the presence of a particular 
mahoganv allele associated with a weight disorder such a c 

0 

obesity, cacnexia, or anorexia, may be detected. Among the 
molecules which can be used for diagnostic methods, such as 
those v/n: ch involve amplification c: genomic mahoganv 
sequences, are primers or probes that can routinely be 
obtained using the genomic and cDNA sequences disclosed 
5 herein. 

In one embodiment, the nucleic acid molecules of the 
invention do not include nucleic acid molecules that consist 
solely of the nucleotide sequence that encodes the attractin 
protein sequence depicted in FIG. 17A-C. 

0 The mahogany nucleic acid sequences of "the invention 

further include fragments of the nucleic acid sequences 
described above. For example, mahogany nucleic acid 
fragments can include fragments of at least 10, 12, 15-, 20, 
30, 40, 50, 100, 150, 200, 300, 400, 500, 600, 70*0, 8O0, ^9O 0 , 
1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 
2000 or more nucleotides. 

The nucleotide sequences of the present invention also 
include (a) DNA vectors that contain any of the foregoing 
mahogany coding sequences and/or their complements; (£>) DNA 
expression vectors that contain any of the foregoing mahogany 

0 coding sequences operatively associated with a regulatory 
element that directs the expression of the <oodi*ig sequences; 
and (c) genetically engineered host -cells and organisms -that 
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contain any of. the foregoing mahogany coding seouences 
operatives, associated with a regulatory element that directs 
the expression of the coding sequence m the host cell As 
usee herein, regulatory elements include, but are not limited 
5 to -.ouciDle and non- inducible promoters, enhancers, 

operators, and other elements known to those skilled in the 
art that drive and regulate gene expression. Such regulatory 
elements include, but are not limited to, the -cytomegalovirus 
hCMV immediate early gene, the early or late promoters of 
o SV40 adenovirus, the lac system, the trp system, the TAC 
system, the TRC system, the major operator and promoter 
regions of phage A, the control regions of fd coat protein 
the promoter for 3 ' -phosphoglycerate kinase, the promoters of 
acid phosphatase, and the promoters of the yeast alpha-mating 
factors. 

in addition to the mahogany gene sequences described 
above, homologs of such sequences, exhibiting extensive 
homology to one or more domains of the mahogany gene product 
can be present in other species. m a preferred embodiment, 
the mahogany gene homologue maps to a chromosomal region that 
as syntenic to the chromosomal region of the mahogany gene 
in a particularly preferred embodiment, a human mahogany gene 
homologue sequence maps to a human chromosome region that is 
syntenic to the region of mouse chromosome 2 to which the 
murine mahogany gene maps, namely 20 P 15, and comprises the 
contiged human MG cDNA provided herein. Further, there can 
also exist homologue genes at other genetic loci within the 
genome of the same species which encode proteins having 
extensive homology to one or more domains of the mahogany 
gene product . Such mahogany homologs can include, for 
example, secreted forms of the mahogany sequences; se € 
Duke-Cohan, J.S. et aJ . (1998, P^oc. Natl, AcaO. Sci. U S A ' 
,5:11336-11341). Such sequences, can be used, for example, 
in the screening assays, described in Section 5.4.2 below, 
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for compounds that interact with the mahogany gene and/or its 
gene product and that may therefore be useful in treating and 
ameliorating body weight disorders. 

Other mahogany homologs can be identified and readily 
5 isolated, without undue experimentation, by molecular 
biological techniques well known in the art, and ar^e 
therefore within the scope of the present invention. As an 
example, in order to clone a human mahogany gene homologue 
using isolated murine mahogany gene sequences, such murine 
mahogany gene sequences may be labeled and used to screen a 

10 

cDNA library constructed from mRNA obtained from appropriate 
cells or tissues derived from the organism (in this -case, 
human) of interest. With respect to the -cloning of -such a 
human mahogany homologue, a human cDNA library may, for 
example be used for screening, such as a cDNA library 

15 obtained from mRNA isolated from brain tissues, particularly 
containing hypothalamic regions. 

The hybridization washing conditions used should *be of a 
lower stringency when the cDNA library is derived from an 
organism different from the type of organism from which the 

2Q labeled sequence was derived. With respect to the -cloning of 
a human mahogany homologue, for example, hybridization can be 
performed for 4 hours at 65°C using Amersham Rapid Hy-b™ 
buffer (Cat. #RPN1639) according to manufacturer's protocol, 
followed by washing, with a final washing stringency of 
i.0xSSC/0.1% SDS at 50 °C for 2 0 minutes being preferred. 

25 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 
specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such 
conditions see, for example, Sambrook et al. t 19€9, Molecular 
30 Cloning, A Laboratory Manual, Cold Springs Harbor Press, 
N.Y.; and Ausubel et al . , 1989, Current Protocols in 
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Molecular Biology, Green Publ i sh v rc * CSC v--r~ 

/isso^iates ana Wi ■ ev 

Interscience, N.Y. 

5 ' tn ^ I3Ceiec rragmer.t may be usee tc 

screen a genomic librarv de^ve^ i-v-^ 

c--\e_ ^ 0ll . tne organism of 

5 interest, again, usmc aDproori ate > v <=-■,--■ ™« 

- LC "-fc-i v E^rjngen: . cone- t ions . 
Furtner, a mahoqanv ae^e h-^'nr-i ^^..-^ 

* ' - * e nwmoJo ? ue may oe isolated *^ on 
nucleic acid of the organism of ,nte-e- h- * " " 

iLeres - by performmc PCF 
using two aegenerate ciioonuc'e^inp r*~-r- - ' 

r , i . " " e ^ iQe Primer pools desianed on 

the basis oi. amino acid sequences withi- th~ ra -hoo-n " 
proaucr cisciosec herenr The - - " " 

10 h<i rnB , ■ . ^ tem - D ^ t& i or the reaction may 

. be cDN. ootamec by reverse transcription of m^A preDared ' 
from, for example, human or non-human ceil l ines or t " issue 

known or susnecrp^ - r> cvt^v-^^^ 

• 6y cG ex Press a manogany gene allele 

that T PCP \ Pr ° dUCt ^ ^ SUbCl ° ned Snd -quenced to ensure 
that tne amplified sequences represent the sequences . of a 
mahogany gene nucleic acid sequence. The PCR f ragment may 
then be used to isolate a full l ength cDNA clone fay a 
of methoos. For example, the amplified fragment may be 
labeled and used to screen a cDNA library, such as a 
bacteriophage cDNA library. Alternatively, the labeled 
20 fragment may be used to isolate genomic clones via the 

screening of a genomic library. This method has been used to 
isolate sequences encoding each of the murine MG gene exons 
as well as to isolate contigs containing the human MG 
sequences provided herein, FIG. io. 
25 PCR technology may also be utilized to isolate full 

length cDNA sequences. For example. RNA may be isolated, 
follows standard procedures, from an appropriate cellular 
or tissue source (i.e.. one Known, or suspected, -to express 
the mahogany gene) . A reverse transcription faction may be 
performed on the RNA using an oligonucleotide primer specific 
for the most 5 ■ end of the amplified fragment for the priming 
of the frrst strand synthesis. The resulting RNA/DNA hybrid 
-y then be ..tailed., with guanines using a standard terminal 
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transferase reaction, they .hybrid may be digested with RNAase 
K. and second strand, synthesis may then be primed with a 
poiy-c primer. Thus, cDNA sequences upstream of the 
amplified fragment may easily be isolated. For a review of 
5 cicninc strategies whach may be used, see e.g., Sambrook « 
al . , 1985 supra. 

Mahogany gene sequences may additionally be used to 
isolate mutant mahogany alleles. Such mutant alleles may be 
isolated from individuals either known or proposed to have a 

Q phenotype which contributes to the symptoms of body weight 
disorders such as obesity, cachexia, or anorexia or disorders 
associated with hyperphagia. Mutant alleles and mutant 
allele products may then be utilized in the therapeutic and 
diagnostic systems described below. Additionally, such 
mahogany gene sequences can be used to detect mahogany .aene 

> regulatory (e.g. promoter) defects which can affect body 
weight . 

A cDNA of a mutant mahogany gene may be isolated, for 
example, by using PGR, a technique which is well known to 
those of skill in the art. In this case, the first cDNA 
strand may be synthesized by hybridizing an oli-go-dT 
oligonucleotide to mRNA isolated from tissue known or 
suspected to be expressed in an individual putatively 
carrying the mutant mahogany allele, and by extending the new 
strand with reverse transcriptase. The second strand of the 
cDNA is then synthesized using an oligonucleotide that 
hybridizes specifically to the 5- -end of the normal ,gene . 
Using these two primers, the product is then amplified via 
PCR, cloned into a suitable vector, and subjected to DNA 
sequence analysis through methods well known to those of 
skill in the art. By comparing the DNA sequence of the 
mutant mahogany allele to that of the normal mahogany allele, 
the mutation (s) responsible for the loss of alteration of 
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activity c: che mutant mahogany gene product ^ 
ascertained. " ~" 

Alternatively, a ' genomi c lib-a-- c -r k 

using DNA obtained from an in--.-: Q u-^ 

,,>< - ,jQUe: suspected of ^ i, n ~._ 
5 co carry tne mutant mahoganv allege c ~ - mm - ~~ 

constructed using R* A from £ ti£cue ^ ^ c « 
express tne mutant mahooanv al , eife t he ' " ^ 
or any suitable fragment" there^ X l h ™ ^ 
as Vorobe tc identify th. ' '" ^ and use = 

allele is such libraries ? m ^any 

10 ^ anes ' Clones ccntaminc the m U -n^ 

mahoqanv aene =eouenr P c ™- - 1T - 1 '-=nt . 

- - - sequences may tnen oe our^Pri ■ 

sequence anaivsis acc-rc<n~ - u ^ cnC ^ected to 

• .-- ' oCC ~ r c^ , c methods well known t~ --^ 

sKiu ir. tne art. r c ^ -nose ci 

Additionally, an expressior ^ c ^-^' ■ 

ut2i i2in c cDNA synthesized f >- Q r ^ „ - ^ l6G 
5 * ' . --om, x0 r example, RN£ -, — - r - 

T * " ssue kno -- » -p«« e , to expre „ . mut ;;r° tec 

~i.og.ny .U. S . ln .„ lndjvldua; su>pected of 

carry such a mutant allele ^ t-v, tC 

screened u S1 n 9 stan dard antibody Z.Z^T'"^ ^ 

mahog.ny gene product a s d es C r ib e d , be " \ 
p^ ~ eiOW ' -^n Section R *v 

^ ECreeC1 " 9 S ee, £or example , Ha ^ 

Lane, eos.. 1S8 8. "Antibodies: A Labor a tory M a nu a l" cold 
Spring Harbor Press. Cold Spring H a rbor Z , i 
mahog any mutation results in * Where a 

altered Action - T T""" ^ 

,„,„..,. 9 -' SS ° result °f amissense or a 

framesh.f. mutation) a polyclonal set of anti ma H 
Product antibodies a re like l y to cross-L c CI Tl7 ^ 
mahogany gene product. Library clon£s ^^T ™ 
reaction witn sucn labeled antibodies can be purl ed I I 
subjected to sequence analvcie P led ana 

-~n. t o tnose It ^ ^ mSth0dS 
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5.2. PROTEIN PRODUCTS OF THE MAHOGANY GENE 

Mahogany gene products (e.g., proteins;, polypeptides 
and peptide fragments, mutant, truncated, cr deleted forms o: 
the mahogany gene product, and/or fusion proteins cf the 
5 mahogany gene product can be prepared for a variety of uses. 
For example, such gene products, or peptide fragments 
thereof, can be used for the generation of antibodies in 
diagnostic assays, or for the identification of other 
cellular or extracellular products involved in the regulation 
of mammalian bodv weiqht . 

10 

Mahogany gene products, also referred to herein as 
mahogany proteins, of the present invention include those 
gene products encoded by the mahogany gene sequences 
described m Section E.l, above. For example, FIG. 2E , 8E 
and 9E depict murine mahogany amino acid sequences. Mahogany 
gene products also include human mahogany gene products as 
shown, e.g. . in FIGS. 10E, I7E, 18E, 1SE, and 20B. 

In addition, mahogany gene products may include proteins 
that represent functionally equivalent gene products. Such 
an equivalent mahogany gene product may contain deletions, 
20 including internal deletions, additions, including additions 
yielding fusion proteins, or substitutions of amino acid 
residues within and/or adjacent to the amino acid sequence 
encoded by the mahogany gene sequences described, in Section 
5.1, above, but that result in a "silent" change, in that the 
change produces a functionally equivalent mahogany gene 

2 5 

product. Such amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, 
hydrophobicity , hydrophilicity , and/or the amphipatfaic nature 
of the residues involved. For example, nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
30 isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and 
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glucamine; positively- charged (basic) amino acids mci-ude 

arqmine , ivsine an^ v, -; 0 ^ 

=n~ '"S^^ne; ana negatively charged 

<SCld:C) amin ° 3CidS inClude "Partic acid and glutamic acid 
"Functionally equivalent", as utilized her-, _ fe _ L. 
5 a gene product (e.g., a protein) capable Qf exhlbiting a ° " 
substantially similar in vivo activity as the 
gene products encoded by the ,g gene sequences described in 
Section 5.2, above. The in vivo activity of the mg gene 
product, as used herein, refers to the ability of the m 9 gene 
10 product, when present in an appropriate . cell type to 

ameliorate, prevent, or delay the appearance of the mahogany 
phenotype relative to its appearance when that cell type 
lacks a functional mahogany gene product. 

Alternatively, where alteration of function is deseed 
■ deletion or non- conservative alterations can produce altered 
mcludang reduced-activity, mahogany gene products. Such 
alterations can, for example, alter one or more of the 
biological functions of the mahogany gene product. Further 
such alterations can be selected so as to generate mahogany' 
gene products that are better suited for expression, scale 
20 up, etc. in the host cells chosen. For example, cysteine ■ 
residues can be deleted or substituted with another amine 
acid residue in order to eliminate disulfide bridges 

As another example, altered mahogany gene products can 
be engineered that correspond to mutants or variants of the 
25 mahogany gene product associated with mammalian weight 

disorders, such as obesity, cachexia, or anorexia. Altered 
mahogany g ene products can also be engineered that correspond 
to mutants or variant*? of t- u~ „,_w~ 

riants of the mahogany gene product known to 

neutralize or ameliorate the symptoms of body weight 

30 IT'T; " 0bCSity ' CaCheXia ' which are 

mediated by some other gene, including, but not limited to 

body weight disorders mediated by the agouti gene. 
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Also within the scope of the present invention are 
peptides and/cr proteins corresponding to one or more domains 
of the mahogany protein or any one of the individual exon 
encoded regions of* the MG protein, as well as fusion proteins 
5 in which rhe full length mahogany protein, a mahogany 

peptide, or a truncated mahogany protein or peptide is fused 
to an unrelated heterologous protein. Such proteins and 
peptides can be designed on the basis of the mahogany 
nucleotide sequence disclosed in Section 5.1, above, and/cr 
on the basis of the mahogany amino acid sequence disclosed in 

10 

the Section. 

The mahogany gene products of the invention further 
include fragments of the gene products described herein. For 
example, mahogany gene product fragments can include 
fragments of at least 10, 12, 15, 20, 30, 40, 5*0, 1O0, 150, 

15 200, 300, 400, 500, 600, 700, 800, 9O0 , 100O, 110*0, 1200, 
1300 or more amino acids in length. 

In one embodiment, it is understood that the gene 
products of the present invention do not include a --gene 
product that consists solely of the amino acid sequence of 

20 the attractin polypeptide depicted in FTG. 17. 

Fusion proteins of the invention include, but are not 
limited to, IgFc fusions which stabilize the mahogany protein 
or peptide and prolong half life in vivo; or fusions to any 
amino acid sequence that allows the fusion protein to be 
anchored to the cell membrane; or fusions to an enzyme, 

25 

fluorescent protein, or luminescent protein whioh provides a 
marker function . 

The mahogany gene products, pepti<3e fragments thereof 
and fusion proteins thereof, may be produced hy recombinant 
DNA technology using techniques well known in the art. Thus, 
30 methods for preparing the mahogany gene products, 
polypeptides, peptides, fusion peptide and fusion 
polypeptides of the invention -by expressing nucleic acid 
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containing mahogany gene sequences are described h • 

Methods that are well how, t- • herei "- 

wen xnown to tnose s w i -i <^ - r- 

be used to construct express.on vecto^o^T " ^ 

gene product codino seauences and COn< ™g mah ogany 

. and -ans-t^or^ appropriate transcriptional 

3 -i.j ^ons iB tionai control siqnal<; T h« " 

, 9 ais - These methods include 

example, the tecnn lqU es described ln sa 

supra, and Ausubel, etal 19SQ „ ,1 " 198S ' 

,„ capable of encoding Alt «nativel y , r„ a 

10 c hemi .a lly syn :::i r 9 ;:; rrr s — - - 

Oligonucleotide Synthesis", 1984 Ga^ „ '■ 
Oxford. ' Gait ' ed " ^ Press, 

» utiliCerr"" ^ hOSt - e ™ i - sector systems may be 
^ zed to express the mahogany gene product cod J 
sequences of the invention Such hn^ 

— — s by „ hich t :r vzi'zzzt ; ystems 

«y be prodded and subseauentl y purged b 

ceils that ^ when ^o^Z^^^Z:^ 
20 appropriate nucleotide codino seouences. exhibit n I 

gene, product of the invention in sicu T T S T ^ 

not iimited to m icroor g anisms such ^ ^ f 

« ™— — -in S :e;ii s :; n e ii;rr cainin9 

Saccharoses. Plchia) transformed with recombinant ' yeast 
expression vectors containinc, the m»>,„ 

coding secuences; i nsec t ce TsZT 7 ^ 

. ■ systems infected with 

recombinant virus e>-r»-o<==, • 

v^orT;:: T c rr th — :~ plant 

vectors (e.g., cauliflower mosaic virus CaMV- roH 

virus, ™v, or transfers „i th ^^^^^ 
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sion vectors (e.g., Ti plasmid) containing mahogany ^ene 
product coding sequences; or mammalian cell systems (e o 
COS, CHO, BHK, 293, 3T3) harboring recombinant expression' 
constructs containing promoters derived from the genome of 
5 mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7 . 5K promoter). 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended 
10 for the mahogany gene product being expressed. For examole, 
when a large quantity of such a protein is to -be produced, 
for the generation of pharmaceutical compositions of mahogany 
gene product or for raising antibodies to mahogany gene 
product, for example, vectors that di-rect the expression of 
high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not 
limited, to the E. coli expression vector P UR278 (Ruther et 
al., 1983, EMBO J. 2, 1791), in which the mahogany gene 
product coding sequence may be ligated individual ly^into the 
vector in frame with the lac Z coding region so that a fusion 
20 protein is produced; plN vectors (Inouye and Inouye, 19€5 
Nucleic Acids Res. 13, 3101-3109; Van Heeke and Schuster,' 
1989, J. Biol. Chem. 264, 5503-5509); and the like. pGEX 
vectors may also be used to express foreign polypeptides as 
fusion proteins with glutathione S- transferase «3ST) . m 
25 general , such fusion proteins are soluble and ^an easily be 
purified from lysed cells by adsorption to glutathione- 
agarose beads followed by elution in the presence of free 
glutathione. The pGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the 
cloned target gene product can be released from the *GST 



30 

moiety. 



In an insect system, Autographs California, -nuclear 
polyhidrosis virus (AcNPV) is used as a vector -to express 
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foreign aenes . Th<=> >- llf -, 

cell. ^ e ^ S 9r ° WS in Copter, frv giperda 

• The m °hogany gene product cori-i™ 
cloned individuals i m- ^mg sequence may be 

Y ° non -essential regions (for 
the polyhedrin gene) of rh. - example 
c - thS Vlrus and Placed under ~ 0 ~ rrl - 

Successfui J s J ti lTl T P ° lyhedrin P ~> • 

-«i r.„ lt ln rL cti ; at r ° 9 ; ny 9ene pr ° duct ~ e 

lading the proteinaceous coat ^7 T' ^ 

» u . - ; t r:„::;: e r g infe " 

-prised. see Smith. . -"-^ gene is 

584 ' Smith ' — t No . ,. 2a5 .osu VlrCl - «' 

in mammalian host cells, a number of viral b.. „ 

expres.icn S ystem e may be utili.ed fVlral - faased 

15 ^ , _ * In cases wherp 

adenovirus is iiq^h iere ar * 

USed as an expression vector th* ^ 

adenovirus transcription/translation /'^^ to *» 
the late promoter and tripartite ! , <™>Plex, e. s ., 

chimeric gene may then be"! e t^eT"" ' ™" 
,0 by in vitro or in vivo recombinat o ^ 
essential region of the viral genome (e " * n0 "" 

""I result in . recombinant virus th a t r ' 91 ° n ^ " 

required for efficient traLla "n of " * 
product coding sequences ^"""V °* ltU ™* ™*°3™Y S«ne. . 
inJH W ences - These signals include the Aiy 

initiation ccdon and adjacent sequences i„ e Cn e AT<3 
entire mahogany gene, including t « 

adjacent sequences i, initiation codon and 

i n yuences, i S inserted int-o t-v,~ 

S ° expression vector no ^- appropriate 

portion, of the mahogany g r" L d ■ 1 ^ " he " ^ 3 

9«ny gene coding sequence is inserted. 
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exogenous t ransi at i ona j conf-e'' e-o*-,--<c 

^ojjl.c. t.q.ic; £ , inciucmg, cernaps, 

the ATG initiation codon, must be provided. Furthermore / the 
initiation codon must be in phase with the reading frame of ^ 
the desired coding sequence to ensure translation/: the 
5 entire insert. These exogenous translation! centre: sionais 
and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression mav be 
enhanced by tne inclusion of appropriate transcription 
enhancer elements, transcription terminators, -etc', (see 
10 Bittner ' er a2 '- 1987 < Methods m Enzymoi . 153, 516-544). 

in addition, a host ceil strain may be chosen that 
modulates the expression of the inserted sequences, or 
modifies and processes the gene product m the specific 
fashion desired. Such modifications (e.g., cl vcosvi at i on ■ 
and processing ,e.c„ cleavage) of protein products mav be 
important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the 
post-translational processing and modification of proteins 
and gene products. Appropriate cell lines or -host systems 
can be chosen to ensure the correct modification and 
20 processing of the foreign protein expressed. To this -end 

eukaryotic host cells that possess the cellular machinery ' for 
proper processing of the primary transcript, glycosylate, 
and phosphorylation of the gene product may be used. Such 
mammalian host cells include but are not limited to CHO, 
2s VERO, BHK, HeLa , COS, MDCK, 293, 3T3 , and WI38. 

For long-term, high-yield production of -recombinant 
proteins, stable expression is preferred. For example, cell 
lines that stably express the mahogany gene product .may be 
engineered. Rather than using expression vectors that 
contain viral origins of replication, host cell* -can be 
30 transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer, -sequences, 
transcription terminators, polyadenylation sites, ,et</ ) . and 
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; Selectab ^ -arker. Follow! no the , nCrod , 
foreign DNA , engineered cell, 1 ! ^ ntrod "c ta on of the 

CL ceils may be (A ^ 

a *ys in an enriched media anH ^ a ^° wea to. grow f or 2 . 2 
selective me d i5 The ^ * re itched- to a 

. .as,, ^ S6leCtable ma ^- in the r ^ T .,__ nt 

. -I..-- ^ JUC rs resistance to the - — ^«ant 

to stabiy iMe9 "» t ta Plasmid in ; 0 ei ; h ": r on ch and aii °- s «»■ 

grow to form foci that in chromosomes and 

«U i lnM . This "™ " n be cl °n*d and expanded 

— «» lines ths : ::;:zv h TT eousiy be u ~ d - 

■10 £UCh en91 ™ ~» "nes may be partToXT " 
screening and evaluation o- ™ r ^="larly use£ul in 

endogenous activity of t „ ^ a " e « «« 

A number of sele ct ^ 9 "°" 

»== Umited to the »~ P 77 sZ7777 77Zs7 

'Wi 9 i.r. « a ,. 1977> celi f '™ne kinase 

" P^phoribosy! transferase ,s 2V bal,v hyP ° Xanthi "*-9uanine 

«— • n«i. AC ad. sci. 0SA "v* j * ,i,w ' 

Pnospnoribosyl transferase (howy 7 a 7 77^ 

9enes can be employed i„ tk - " * J " ' 198 °' Ce » M. 8:7, 

respectively. Also , nr ■ \ ° r »P«- cell., 

aiso, antimetabolite 
20 the basis of selection for the f o, ! 1StanCe " n ««d as 
confers resistance t. whot J^ 9 -'. -ich 
Natl. Acad. sci. usa 77 (Kl 9 J ", e t aj . , 19B0 , 

to mycophenolic acid <Mulli M ' T.1 ""^ C ° n£ers ^"stance 
26 Acad. sci. USA ,8 2 ol7 7 9 ' 1981 - PrOC ' 

the aminoglycoside c~4 le <77i C ° n£ers "distance to 

«ol. Biol. 150 , 1) . and h berre " <3arapin - «-i..Vl.8i. j 
hygromycin <Sa n t re — »~ «> 

Alternatively che ' ~~ 3 °" »') • 

eiy ' tne expression chara^ 

0 end09an ° US * ah °^ gene „ithin a cell ° f " 

my be modified by inserting a heteroll " 
element into the genome of a staoll ceT^ 
microorganism such thar ft • 6 ° r clon ed 

" lnS " Ced "gulatory element is 
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operativejy linked with the endogenous mahogany gene. For 
example, an endogenous mahogany gene which is normally 
"transcriptionally silent", i.e., a mahogany gene which is 
normally not expressed, or is expressed only a very low 
5 levels in e cell line or microorganism, may be activated by 
inserting a regulatory element which is capable of promoting 
the expression of a normally expressed gene product in that 
cell line or microorganism. Alternatively, a 
transcriptionally silent, endogenous mahogany bene may be 
o activated by insertion of a promiscuous regulatory element 
that works across cell types. 

A heterologous regulatory element may be inserted into a 
stable cell line or cloned microorganism, such it is 
operatively linked with an endogenous mahogany gene, using 
techniques, such as targeted homologous recombination, which 
' are well known to those of skill in the art, and described 
e.g., in Chappel , U.S. Patent No. 4,215,051; U.S. Patent No. 
5,578,461 to Sherwin et al . ; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al . ; and 
International Application No. PCT/US90/06436 (WO91/06€€7) by 
Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

Alternatively, any fusion protein may be readily 
purified by utilizing an antibody specific -for the fusion 
protein being expressed. For example, a system described by 
Janknecht, et al . allows for the ready purification of non- 
denatured fusion proteins expressed in human cell lines 
(Janknecht, et al . , 1991, Proc. Natl. Acad. Sci . USA Be, 
8972-8976). in this system, the gene of interest is 
subcloned into a vaccinia recombination plasmid such that the 
gene's open reading frame is translationally fused to an 
amino- terminal tag consisting of six histidine residues. 
Extracts from cells infected with recombinant vaccinia virus 
are loaded onto Ni>* • nitriloacetic acid- agarose columns and 
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histidane-tagged proteins are . ejec 

iniidazoie-containinc buffers. ~ - Wlth 
The mahogany gene products can a: 



transgenic animals" A, lma -, c ^ ^ - eXpres£e ^ * 

»<* limiceo to , mice> ^ SP6Cies ' -eluding, 



out 



micro-pig Ef goats, sheep -J™"*' P19S ' pl9£ ' 

baboons, monkeys - -on-human primates, e.c 

nKeys - a "° cnimoanzee- u 
■»hog.„v c,an sgenlc animaiE £ ™> ^ ««» « S . n .„ te 

i *~ * * .Tier Le>*m " -v — _ 

herein, refers to animals expre^ • — < ns 9enic," a£ U£ed 
10 ^om a different specie- (e c '"^ 9«e sequences 

mahogany gene sequences; ae" We1 ^ eXpressin 9 human, 
genetically engineered to ove^ " ^ h3Ve b — 

species; mahogany seouence* o^ - n Venous (i.e., same 

genetically engineered to noion^ ^ ^ ^ 

mahogany gene seouence- ( • ^ eXpr6S£ exogenous 

15 their progeny. " *" " knoc *-°^" animals), and 

Any technique known in the art ' 
a mahogany gene transcene into „ • y USed t0 introduce 
—s of transgenic anUs ^T h^ ^ fOUnd ~ 

are not limited to .pro»ucle.r J techni *»" include, but 
2 0 Wagner, 1989, u S P a t M ^roin section (Hoppe : and 

w.o. Pat. No. 4, 873,191). 
9ene transfer i nto 9er m l ines (Vari ' ^ovarus mediated 
-roc. Nat l. Acad . Sci>< USA e 8 S 2 ^^f 1 ^ PUtt — t a,., 1985 , 
embryonic stem cells (Thompson' et ai" 6 ^ ^ «■ 
321), electroporation of embryos " ' ^ 56 ' 3l3 " 

2S 3, -3- 18l4); and sperm-meta" d gelT 3 ' ^ ' 

■ . 1989, cell 57, 717-723) ' 9 * < L -itrano et 

techniques, .ee Gordon, i 989 TraV ^ ^ 

Cytol. 115, 171 . 229) 9 ' Trans 9^ic Animals, i ntl . Rev 

Any technique known in the art- 
3o transgenic animal clones containing T'J* '° 

example, nuclear transfer into ehucieated ^ tr "-?^' f ~ 

■ from cultured embryonic fetjll Cle3t€d «>cytes of nuclei 

' fStal ° r adult «lls induced to 
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quiescence '.Campbell, et al . , 1996., Nature 360, 64 -6£; 
Wiimut, er as., Nature 265, £10-613). 

The present invention provides for transgenic animals 
chat carry a mahogany transgene m all their cells, as we"> " 
5 as animals that carry the transgene in some, but net all 
their cells, i.e., mosaic animals. The transgene may ce 
integrated as a single transgene or in cencatamers, e.c, 
head-to-head tandems or head-to-taii tandems. The transgene 
may also be selectively introduced into and activat.ee in a 
10 particular ceil type by following, for example, the teachino 
of Lasko et al . (Laskc, et al . , 1 992, Proc . Natl. Acad. Sc -: " 
USA 89, 6232-6236). The regulatory sequences required for"" 
such a cell -type specific activation will depend "upon the 
particular ceil type o: interest, and will be apparent to 
^ those of sKii: in the art. When it is desired that the 
- mahogany transgene be integrated into the chromosomal site of 
the endogenous mahogany gene, gene targeting is preferred. 
Briefly, when such a technique is to be utilized, vectors 
containing some nucleotide sequences homologous to the 
endogenous mahogany gene are designed for the purpose of 
20 integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of the nucleotide 
sequence of the endogenous mahogany gene. The transgene may 
also be selectively introduced into a particular cell type, 
thus inactivating the endogenous mahogany gene in only that 
25 cell type, by following, for example, the teaching of Gu, et 
al. (Gu, et al.. 1994, Science 265, 103-106). The regulatory 
sequences required for such a cell-type specific ina-ct ivation 
will depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art. 

Once transgenic animals have been generated, the 
expression of the recombinant mahogany gene may oe assayed 
utilizing standard techniques.' Initial screening may -be 
accomplished by Southern blot analysis or PCR ■techniques to 
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analyze animal tissue. ^ 

—gene has tsken ^ : h " her of the 

the transgene ln the ^^^^ ^ o f ^ A expression o£ 
aiso be assessed us.ng techniaues ^ "T^ 

Horcnern blot =n-i, re noc 

f ™ ^ in j ° f ««u. sa mpl es obtained 

~= «■«..: «y -,^e maho9any 9ene - 

USln 9 antibodies specific for th lmmun °^tochemic a l iv 
10 mahogany transgene product' 

■ ein ar e methods for t-u 
antibodies capable of soecificaliv Paction of 

gene product epitopes, or eoiton reC ° 9 " i2in 9 <»» or „ e ma 
peptide fragments of the J COnserv * d variants, or" 

1 Vantibodies that specif cal T v 9ene Pr ° dUCtS - ^" urtn er , . 
9ene products, are ene^J ™™ ^ « * 

Such antibodies may include y ' ! lnVentlOT ' 
Polyclonal antibodies, monoclone, ant h^ "" """^ C °' 
humanized or chimeric antibort antib ° d «s <mAbs, , 

20 Fab fragments, K( . b .,, f ^J"" ^ ^ antibodies, 
expression library, anti-idiotyp-J"^^,^^ ^ a Pab 
epitope-binding fragments of £ of the H "* and 

antibodies may be used f„ ab ° Ve - Such 

used, f or example in h k. ^ 

9ene product in an biological saLe Y ^" ion <* a „ 9 
25 util i2 ed as part of a diagnostic^ ""^ * 

"hereby patients may be tested f '" W " lc ^hnique 
products, and/or for the prl^Y^ 1 ^ °* * »- 
gene products. Such antibodie^ abno ""al forms of such 

conjunction „i th . for exalT ^ ^ * in 
^escribed, belo W , in ~« ~9 schemes, as 

effect of test compounds on m ' aan ' * SValu »"°" of the 

activity. Additionally Tl a ^ 

conjunction with .he gene th L «— « 

9ene therapy techniques described 
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10 



below, m Section 5.4.3.2, to, for example, evaluate the 
normai and/or engineered mahogany- expressing cells prior to 
their introduction into the patient. 

Anti-mg gene product antibodies may additionally be used 
111 meChods for inhibiting abnormal mg gene product -activity 
Thus, such antibodies may, therefore, be utilised as part of 
weight disorder treatment methods. 

For the production of antibodies against a mg gene 
product, various host animals may -be immunized by injection 
with a ma gene product, or a portion thereof. Such host 
animals may include, but are not limited to rabbits, mice 
and rats, to name but a few. Various adjuvants <nay be used 
to increase the immunological response, depending on the host 
species, including but not limited to Freund's (complete and 
incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecit-hin, pluronic polvols 
polyanions, peptides, oil emulsions, keyhole limpet * 
hemocyanin, dini trophenol . and potentially useful human 
adjuvants such as BCG (bacilie Calmette-Guerin) and 
Corynebac Cerium parvum. 

20 Polyclonal antibodies are heterogeneous populations of 

antibody molecules derived from the sera of animals immunized 
with an antigen, such as a mg gene product, or an antigenic 
functional derivative thereof. For the production of 
polyclonal antibodies, host animals such as those described 
25 above, may be immunized by injection with mg gene -product 
supplemented with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations 
of antibodies to a particular antigen, may -be obtained by any 
technique that provides for the production of antibody 
^ molecules by continuous cell lines in culture. These 

include, but are not limited to, the hybridoma teehnioue of 
Kohler and Milstein, (1975, Nature 256, 495-497; and -U S 
Patent No. 4.376,110), the human B-cell hyfcridoma technic 
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(Kosbcr et a2 '-op-- . 

= • «Cqc. Sc:. US a cr _ 

^ybridoma techmoue (Cole et ^ e '~ c "^ >30) - and the EB V - 

And Cancer Therapy, Aiar r . T °" " J^' *° noca °^i Antibecue* 

^M, IgE, JpA Tgri . _ " 0t>u - li - CIaS£ anciudmo JcG 

-9^ ana any subclass ^pr an - " 
producing the mAb o* t^ e - The ^ridoma 

--o or vivo ; ro v:; t ™ ion ^ - - itivated ir 

-roauction of hn a v. ^ 

makes this ^hp - ^ " cers of mAbs v r , 

. e.en.jy preferred method r- ^ - 
10 In addition -terh-s«, - " 1 production. 

Chlmenc «tibodi ee « (Morrasor e / " ' proauc ^°n of 

Ac ad. £ca ., B1 , W. 685e M ^ ° J " 1984 ' ^c. Natl. ■ 

6^55; Neuperae- 
312, 604-60E; Takeda e , --, . ~ ' ' 1984 ' Nature 



- et a j . , 19B5 - 

■splicing the genes „orr, - m 314 ' 4 52-454) 

■ i£ appropriate sntigen ^ ^ <* 

human antibody of t0 f et "« «*th genes f rom a 

canbeused. A .^^^ ^ 
Afferent porcions are der ^IfT ^ 

SUCh 3S th °- ^ving a variable Zi" ^ 
;«Ab and a human immunoglobulin co ^ * mUrine 

20 Cabilly et al. u s Z re9ion " (See, e a 

u.s. Patent No 4 o u : 

»-S. Patent No. «, ei6397 , „ hich a '!"' 567; " d « al . . 

; - fere „ce in their entirety h ; Ch ™ — rporated herein by 

In addition, techniques have been dev , 
^eduction of hum ani 2 ed antibodies L e T 
25 Patent No. 5.565,089. „ hic h is ino ' " ° Ueen - "-S. 

reference in its entirety f """^ated —in by 
cnain vari aMe region con^^of T^T 1 " " h — 

interrupted by three hypervariabl. £rame "° rt " region 
complementarilv determinin= re 9 10 ns, referred to as 

- fences of Proteins of t^^^ ^ 
^at)at, E. et al n q yicai Interest" 

services (19 33,."' o f * slth and ^ ' 

^ hUmaniMd -«^ies are antibody 
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molecule, from non-human specie, having one or more CDRs fr om 
tn. nor, human species and a framework region from a human 

immunoglobulin molecule. 

Alternatively, techniques described for the production 
5 of sanaie chain antibodies (U.S. Patent 4,946,778- Bird 

1986. Science 242, 423-426; Huston, et al . , 1988 , Proc .' Natl 
Acac. Sc. USA 85, 5879-5883; and Ward, et al . , 19 89, Nature 
334, 544-546) can be adapted to produce single chain 
antiboaies against mahogany gene products. Single chain 
iQ antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge 
resulting in a single chain polypeptide. 

Antibody fragments that recognize specific epitooes may 
be generated by known techniques. For examole, such * 
fragments include but are not limited to: the F(ab') 
fragments, which can be produced by pepsin digestion' of the 
antibody molecule and the Fab fragments, which can *e 
generated by reducing the disulfide bridges of the F<ab.) 
fragments. Alternatively, Fa b expression libraries may be 
constructed (Huse, et al . , 198 9, Science, 246, 1275-1281) to 
20 allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

5-4. USES OF THE MAHOGANY GENES, 

GENE PRODUCTS. Awn ^NTTRODTttc 

^ Described herein are various applications of the 

mahogany genes, of the mahogany g €ne products, including 

peptide fragments thereof, and of antibodies directed against 

mahogany gene products and peptide fragments thereof Such 

applications include, for example, prognostic and diagnostic 

evaluation of body weight disorders and the identification 

30 subjects with a predisposition to such disorders as 

described below, in Section 5.4.1. Additionally, -such 

applications include methods for the treatment of body weight 
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and faocv we -a'— — ^ - 

5 -«-. and fcr . — Dec - ^i.ow, ir. Section 



the exDress:,r ~* . . " " -ompounds which moa^a^ 

the mahogany aene rroduc- -_ 0 * u ' ;e ««ivity of 

he *i ^s,.. . . • UC ^' of ^escribed ir. Se — = - - 

— ,ucr. compounds car. , n .-,„ de ™" ~ " — " < 



cellular products wh^ c v -^/"^ ' 1Cr **«nipie, other 

regulation. " oa -' w eaght. 

--ese compounds can be use- < „ 
amelioration c * n^,, • ■ • r exam cie, - - th . 

* DO °y ^ight disorder- , nr1 ,^- 
cachexia, and anorexia . •' . " ncJjCin ? obesity, 

Whi ' 6 ' f ° r ci «» yi .the uses dec C .- b - • 
are primarily uses rented * c b - ^ thl * s ^ion 

abnormalizes, iE tc ^ ^ 6i8 °**** 

and. therapeutic treatment! 11^^^. °' ^ dia 9"ostic 

»- ~-on with ;;: h ;; ::;:: t n s can addi —^ 

the manoganv cene such - e u ■ associated with 

» other disorders resul ting ^.TnT 19 ^""^ ' and 

9 1U lnCrease ° metabolic rates. 

5 -4.1. DIAGNOSIS OP BODY WEIGHT 
D ^SPER_ABNp3?IALlTl£ 



20 and pi;;:;:;: 7^;xt; ot the 

c*..ity. cachexia, and ^ludin 
of subjects having a predisposition to \ ^""^tion 

Such methods may, for ^ e ^ «" 
the mahogany gene nucleotide seLn I rea ^nts such as 
5-1. and antibodies directed bribed in Section 

Eluding peptide fragments ^ne products, 

Section 5.3. Specifically suc T f ' 3Sd — ibed, above, in 
example, for.- * r « a 9«*s may be used, f or 

(1) the detection of the pres 
stations, or the detection *" Ce ° f ^ogany ,ene 

30 expression of ma hogany gene ^1.^^.^ Under ~ 

expression in a wil d - type h „ ° ^ Vels of ^hogany 

type, non-body weight disorder state 
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which correlates with certain body weight disorders c- 
susceptibility toward such body weight disorders; 

(2) the detection of over- or under- abundance c: 
mahogany gene product relative to the abundance of mahco; 
5 gene product m a wild- type non-body weight disorder state 
which correlates with certain body weight disorders or 
susceptibility toward such body weight disorders; and' 

(3) the detection of an aberrant level of mahoqanv -aene 
product activity relative to mahogany gene product -activitv 
io levels in a wild-type, non-body weight disorder state which 
correlates with certain body weight disorders or 
susceptibility toward such body weight disorders. 

Mahogany gene nucleotide sequences can, for examoi^ be 
used to diagnose a body weight disorder using, for example 
the techniques for detecting mutations m the mahoganv aene 
15 aescrabec above in Section 5.1, above. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits comprising 
at least one specific mahogany gene nucleic acid or anti- 
mahogany gene product antibody reagent described herein, 
20 which may be conveniently used, e.g., in clinical settings, 
to screen and diagnose patients exhibiting body weioht 
disorder abnormalities, and to screen those individuals 
exhibiting a predisposition to developing a body weight 
disorder abnormality. 

25 For the detection of mahogany gene mutations, any 

nucleated cell can be used as a starting source for genomic 
nucleic acid. For the detection of mahogany gene expression 
or mahogany gene products, any cell type or tissue in which 
the mahogany gene is expressed may be utilized, such as, for 
example, tissues or cells shown herein to express the MG 

30 gene. 
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Nucleic acid-based detection t--^„ ■ 
below - c a ° n techniques are described 

oeiou ,, _ r. Section ^ 4 - - - . - ' 

de e , , *-epcade detection technics are 

oe - >, " Dec ' oeiow, m Section 5.4.1.2. 

^ utilizing a number of technics. Nuclelc . 
. «uo :ro» any nudeated eel! can be used as the .t-rtin 
10. P-nt for suc h assa y technics, and may be , , . 

C MS " Ma *"* ™^ "* Preparation procedures 
wnicn are «U k „o„n to those of sKin in the a rt 

Genomic DNA may be used in hybridization 

edification assays of biologic.! sample, to detect 

» abn °™lities involving mahogany gene structure including 

point stations, insertions, deletions and ,H lnC1Udln9 
„,„„„ .. UOO! and chromosomal 

rearrangements. Such assays mav include but 

t. southern analyses, single -i^tn^ ~ ^ 

polymorphism analyses (SSCP) , an d PCR analyses. 

Diagnostic methods for the ri^i-^^ ■ ^ 
20 st)er<f . ^ r uile de tectaon of mahogany gene- 

spec, fic mutations can involve for examni « 

incubating nucleic acids ^^^^7^ ".V. 

source „ 7 " Pa " ent ^ " ° th " ~i- ovular 
source „ lth one or more labeled nucleic acid reagents 

inducing recombinant DNfl molecules, doned genes or 

,5 degenerate variants thereof, such as described in Section 

or flawing ^-^^."^^T 
, 0 ^1^" — - « to 3 0 

After incubation, all non- anne aled nudeic acid. 

:r f r the nucieic — — y ^ij ; d are The 

presence of nudeic adds that have hybrid. J Jj'jf 
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molecules exist, is then detected. Using such a detection 
scheme, the nucleic acid from the cell type or tissue of 
interest can be immobilized, for example, to a solid support 
such as a membrane, or a plastic surface such as that on a 
5 microtiter plate or polystyrene beads. In this case, after 
incubation, non- annealed , labeled nucleic acid reagents of 
the type described in Section 5.1 are easily removed. 
Detection of the remaining, annealed, labeled mahogany 
nucleic acid reagents is accomplished using standard 
techniques well-known to those in the art. The mahogany gene 
sequences to which the nucleic acid reagents have annealed 
can be compared to the annealing pattern expected from a 
normal mahogany gene sequence in order to determine whether a 
mahogany gene mutation is present. 

In a preferred embodiment, mahogany gene mutations or 
15 polymorphisms can be detected by using a microassay of 

mahogany nucleic acid sequences immobilized to a substrate or 
"gene chip" (see, e.g. Cronin, et al . , 1996, Human Mutation 
7 ;244-255) . 

Alternative diagnostic methods for the detection of 
20 mahogany gene specific nucleic acid molecules, in patient 

samples or other appropriate cell sources, may involve their 
amplification, e.g., by PCR (the experimental -embodiment set 
forth in Mullis, 1987, U.S. Patent No. 4,^83,2*02), followed 
by the analysis of the amplified molecules using techniques 
well known to those of skill in the art, such as, for 

2 5 

example, those listed above. The resulting amplified 
sequences can be compared to those that would be expected if 
the nucleic acid being amplified contained only normal copies 
of the mahogany gene in order to determine whether a mahogany 
gene mutation exists. 
30 Among those mahogany nucleic acid sequences which are 

preferred for such amplif i cat ion -related diagnostic screening 
analyses are oligonucleotide primers which amplify -mahogany 
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exon sequences. The sequences of such oligonucleotide 
primers are, therefore, preferably derived. fro. manocany 
intron sequences so than the entire exon, or codmc Region ' 
can be analyzed as discussed below. Primer pairs useful for 
5 amplification of mahogany axons are preferably derived from 
accent introns. Appropriate primer pairs can be chosen 
. such that each of the 25 mahogany axons are amolified 
Primers for the amplification of mahogany exons can be 
routinely designed by one of ordinary skill in the art by 
io utilizing the exon and intron sequences of mahoganv shown in 
Mgures, particularly FIGS. 3 and 5. 

Additional mahogany nucleic acid sequences which are 
preferred for such amplification-related analyses are those 
which will detect the presence of a mahogany polymorphism 
wnach doffers from the consensus mahogany sequence deoicted 
in F29UreS ' PartiCUlar1 ^. those «-t detect, the polymorphism 
xoentified in exon 15 (Figure 7) . Such polymorphisms include 
ones which represent mutations associated with bodv weight 
disorders such as obesity, cachexia, or anorexia 

Further, well-known genotyping techniques can be 
20 perrormed to type polymorphisms that are in close oroximity 
to mutations in the mahogany gene itself, includino mutations 
associated with weight disorders such as obesity, cachexia, 
or anorexia. Such polymorphisms can be used to identify 
individuals in families likely to carry mutations in the 
2s manogany gene. If a polymorphism exhibits linkaoe 

disequilibrium with mutations in the mahogany gene the 
polymorphism can also be used to identify individuals in the 
general population who are likely to carry such mutations 
Polymorphisms that can be used in this way include 
restriction fragment length polymorphisms (RFLPs) , which 
involve sequence variations in restriction enzyme target 
sequences, single -base polymorphisms, and simple sequence 
length polymorphisms {SSLPs) . 
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For example, Weber (U.S. Pat. No. 5,075,217) describes a 
DNA marker based on length polymorphisms in blocks of <dC- 
dA)n-(dG-dT)n short tandem repeats! The average separation 
of (dC-dA)n-(dG-dT)n blocks is estimated to be 30,000-60,000 
5 bp. Markers that are so closely spaced -exhibit a high 
frequency co- inheritance , and are extremely useful in the 
identification of genetic mutations, such as, for example, 
mutations within the mahogany gene, and the diagnosis of 
diseases and disorders related to mutations in the mahogany 
gene . 

10 

Also, Caskey et al . (U.S. Pat. No. 5,364,759) describe a 
DNA profiling assay for detecting short tri and tetra 
nucleotide repeat sequences. The process includes extract ino 
the DNA of interest, such as the mahogany gene, amplifying 
the extracted DNA, and labelling the repeat sequences to form 
15 a genotypic map of the individual's DNA. 

A mahogany probe could additionally be used to directly 
identify RFLPs . Further, a mahogany probe or primers derived 
from the mahogany sequence could be used to isolate genomic 
clones such as YACs, BACs, PACs, cosmids, phage, or plasmids. 
20 The DNA contained in these clones can be screened for single- 
base polymorphisms or SSLPs using standard hybridization or 
sequencing procedures. 

The level of mahogany gene expression can also be 
assayed. For example, RNA from a cell type or tissue k^own, 
25 or suspected, to express the mahogany gene, such as muscle, 
brain, kidney, testes, heart, liver, lung, skin, 
hypothalamus, spleen, and adipose tissue may be isolated and 
tested utilizing hybridization or PCR techniques suc-h as are 
described, above. The isolated cells -can be -derived from 
cell culture or from a patient . The analysis of -cells tate .n 
30 from culture may be a necessary step in the assessment of 
cells to be used as part of a cell -based gene therapy 
technique or, alternatively, to t^st the effect of compounds 
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or: the expression of the mahogany gene Such - 
reveaj both quantitative and ouaiiL ana ^*s "ay 

— — n or the ^ ^ ^ 

or ^.ctiv.tion o £ mahogany gene 09 

--..one embodiment of- such a detection W 
moiecuie i £ synthesis * erection scheme, a cDNA 

<..,,. bv reverse "a! ^ ^ " 

for LcL e ^"^J" 9 *• «-» «ed as the 

LieiC acid amDiifirafinn ^ 
10 PC* a m p liflcation reaccion> or - th .\ C ; k t ;° n «•«»». such as a 

reagents used as synthesis initiation ""V •«* 

Primers, in the reverse trans' ^ < *' S " 

-plication steps ofth ^ "* " Uel « lc 

cceps or this method are h. 

mahogany gene nucleic ariH ° Sen frora am °ng the 

The preferred lengths *L ^ — 
15 least 9-30 nucleotides. reagents are at 

^U^trr^" 1 " — — acid 
-ioactive ly l^TH^^T?* " — 
amplified product may be ma<Je J^*™'"**' . 

2 0 visualized by standi ethidium hromid tainLT" 7* * 
-txxaxing any other suitable nucleic acid st " * 

As an .lt.rn.tiv. to ^liUc.^ t. k ^ °" th0d - 

Northern analyses can be „. f tlm "^KJues. standard 

-« expression of -el * 

2S entity of the appelate cenl can be 

Additionally it- ,• «, - • ined - 

~ expression L^."^ ^' J" ^ 
sections (fixed and/or frozen, of " rSC " y Upon "ssue 

from biopsies or resections U c h ^ "7* — 
purification is n.cessarv „ T ™«l.ic acid 
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G.J., 1992 . "PCR jr. Situ Hybridisation: Protocols And 
Applications", Raver. Press, NY; 



5 • 4 • i • 2 . DETECT I O K OF MAHOGANY -KKNE PRODUCTS 
Mahogany gene products, including both wild- type and 
mutant mahogany gene products, conserved variants, and 
polypeptide fragments thereof, which are discussed, above, i 
10 Section - - 2 - ma >' be detected using antibodies which are 
directed against such mahogany gene products. Such 
antibodies, which are discussed i- Section 5.;, below, may 
thereby be used as diagnostics and prognostics for a body* 
weight disorder. Such methods .may be used to detect 
abnormalities - the level of mahogany gene express! or. or of 
15 mahogany gene product synthesis, or abnormalities ir. the 
structure, temporal expression, and/or physical location of 
mahogany gene product. The antibodies and immunoassay 
methods described herein have, for example, important in 
vitro applications in assessing the efficacy of treatments 
20 for body weight disorders such as obesity, cachexia, and 
anorexia. Antibodies, or fragments of antibodies, such 'as 
those described below, may be used to screen potentially 
therapeutic compounds in vitro to determine their effects on 
mahogany gene expression and mahogany gene product 
25 production. The compounds that have beneficial effects on 
body weight disorders, such as obesity, cachexia, and 
anorexia, can thereby be identified, and a therapeutically 
effective dose determined. 

In vitro immunoassays may also be used, for example, to 
assess the efficacy of cell -based gene therapy for a body 
weight disorders, including obesity, cachexia, and anorexia. 
Antibodies directed against mahogany ge-ne products -may fee 
used in vitro to determine, for example, -the level of 
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ene .. B " : expressi °" achieved in cells genetically 

enc.neerec to produce mahogany gene Droduct In 

inrrace' no - ' " in cne case of 

— LiCi moftogany gene products , u , h «n 
done, ^f^,!,, . . such °" assessment is 



preferably, using cell lysates or extracts" 



, 5 n S -, , — extracts. Such 

v n vivo -= ■- ^ C ° acmeve therapeutic efficacy 

-n ..vc, „ E «u as optimization of the -oene r„,= 

Brotoco: 9 "Placement 



The t issue or cell type to be analyzed will ' „ 

10 include those that are ta own, or susoecteT 2 " 

mahogany gene. The protein .solar, th<? 

~y. for example, be such as t^ ^ h '~ ln 

-ne ,„„, antibodies r L aboT """^ ""^ 

Harbor Laboratory Pr ' C L s ° V """^ " ' ^ ^ 

Elated cells can b! deriC d f^T ^ ' "» 

» P«ient. The analysis 0^11 tl^f ~ ' ' ~» « 

ceiis taken from culture m»M> k« 
necessary ch Pn _ u ^ Lure ma Y be a 

o f a cei^arL^:;— ni:ri: ^ - - 

20 mah Pr6£erred di ^-«c methods for the detection of 
mahogany gene products, conserved variants or peptide 
fragments thereof, may involve, for examole 
wherein n. „ h example, immunoassays 

wherein the mahogany gene products or conserve , ■ 
Peptide fragments are detected by thei!" " " 

25 anti-mahogany gene product -specif ic Z^"^ " 
For example, antibodies or- ■f>-= 
such as those described, ab ve i 

«c ^antitative^ or g ualitatively^ dete "he ^ "* 
mahogany gene products or conserved PreSe " Ce ° f 

fragments thereof This dnT " Pep " de 

30 i«,unofluorescence techni^^™^' » 
labeled antibody < see bell, thi^sTct , ^""^ 
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Such techniques are especially preferred for mahogany gene 
products that are expressed on the cell surface. 

The antibodies (or fragments thereof) useful in the 
present invention may, additionally, be employed 
5 histologically, as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of mahogany gene oroducts. 
conserved variants or peptide fragments thereof. m situ 
detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled 
o antibody that binds to a mahogany polypeptide. The antibody 
(or fragment) is preferably applied by overlaying the labeled 
antibody (or fragment) onto a biological sample. Through the 
use of such a procedure, it is possible to determine not only 
the presence of the mahogany gene product, conserved variants 
^ or peptide fragments, but also its distribution in the 
> examined tissue. Using the present invention, those -of 
ordinary skill will readily recognize that any of a wide 
variety of histological methods (such as staining procedures) 
can be modified in order to achieve in situ detection of a 
mahogany gene product . 

Immunoassays for mahogany gene products, conserved 
variants, or peptide fragments thereof will typically 
comprise: (1) incubating a sample, such as a biological 
fluid, a tissue extract, freshly harvested cells, or lysates 
of cells in the presence of a detectably labeled antibody 
capable of identifying mahogany gene products, conserved 
variants or peptide fragments thereof; and <2) detecting the 
bound antibody by any of a number of techniques well-known in 
the art . 

The biological sample may be brought in -contact with and 
immobilized onto a solid phase support or carrier, such as 
nitrocellulose, that is capable of immobilizing cells, cell 
particles or soluble proteins. The support may then be 
washed with suitable buffers followed by treatment with -the 
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with appropriate substrate., preferably a cnrcmoce-: 



-:er v 



substrate, •:. such a manner as :c produce a chemical 
that can be detected, for example, by spectrophotometry, 
fiuor:metr:c. or by visual means. Enzymes that can be usee t 
£ detectable- label the antibody include, but are not l;~:tec 
to, maiate. dehydrogenase, staphylococcal nuclease, delta-5- 
steroid isomerase, yeast alcohol dehydrogenase, o- 
giyceropnosphate, dehydrogenase., tr.ose phosphate ■ isomerase . 
horseradisn peroxidase, alkaline phosphatase, asparaginase,' 
Q glucose oxidase, t-geiactosidase . ribonuci ease , urease, 

cataiase, gl ucose- 6 - phosphate dehydrogenase, giucoamviase anc 
acetylcholinesterase. The detection can be accomplished bv 
coionmetric methods that employ a chromogenic substrate for 
the enzyme. Detection may also be accomplished by visual 
comparison of the extent of enzymatic reaction of 'a substrate 
" :n comparison with similarly prepared standards. 

Detection may also be accomplished using any of a 
variety of other immunoassays. For example, by radioactively 
labeling the antibodies or antibody fragments, it is possible 
to detect mahogany gene products through the use of -a 
radioimmunoassay (RIA) (see, for example, Weintraub, B . 
Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 
1986) . The radioactive isotope can be detected by such means 
as the use of a gamma counter or a scintillation counter or 
by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, it* 
presence can then be detected due to fluorescence. Among the 
most commonly used fluorescent labeling oompounds are 
fluorescein isothiocyanate , rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde and 
f luorescamine . 
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that bine to mahogany gene regulatory sequences {-e.g., 
promoter sequences; see e.g., Piatt, 1994, J. Biol. 'chem. 
269, 2E5SE-26562) , which is incorporated herein by reference 
in its entirety, and that can modulate the level of mahogany 
5 gene expression. Such compounds may include, but are not 
limited tc, small organic molecules, such as ones that are 
able to cross the blood-brain barrier, gain to and/or entry 
into an appropriate cell and affect expression of the 
mahogany gene or some other gene involved in the body weight 
io regulatory pathway, or intracellular proteins. 

Methods for the identification of such proteins are 
described, below, in Section 5.4.2.2. Such proteins may be 
involved in the control and/or regulation of body weight. 
Further, among these compounds are -compounds that affect -the 
level of mahogany gene expression and/or mahogany gene 
15 product activity and that can be used in the therapeutic 
treatment of body weight disorders, including obesi-ty, 
cachexia, and anorexia, as described, below, in Section 5.9. 

Compounds may include, but are not limited to, peptides 
such as, for example, soluble peptides, including but not 
20 limited to, Ig-tailed fusion peptides, and members of random 
peptide libraries; (see, e.g., Lam, et al . , 1991, Nature 3S4 , 
82-84; Houghten, et a J . , 1991, Nature 354, B4-8€), and 
combinatorial chemistry-derived molecular library made of D- 
and/or L- configuration amino acids, phosphopep tides 
25 (including, but not limited to members of -random or parti-ally 
degenerate, directed phosphopept ide libraries; see, e.g., 
Songyang, et al . , 1993, Cell 72, 767-778), antibodies 
(including, but not limited to, polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain 
antibodies, and FAb, F(ab') 2 and FAb expression library 
fragments, and epitope-binding fragments thereof), and small 
organic or inorganic molecules. 
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and the test compound, which is not anchored, may he labeled, 
either directly cr indirectly. 

It. practice, microtiter plates are conveniently 
utilized as the solid support. The anchored component may be 
5 immobilized by non-covalent or covaient attachments. Non- 
covaien: attachment may be accomplished by simply .coating the 
solid surface with a solution of the protein and drying. 
Alternatively, an immobilized antibody, preferably a 
monoclonal antibody, specific for the protein to be 
iQ immobilized may be used to anchor the protein to the solid 
surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the non- immobilized.' 
component as added to the coated surface containing the 
anchored component. After the reaction is complete, 
unreected components are removed (e.g., by washing) under 
15 conditions such that any complexes formed will remain 

immobilized on the solid surface. The detection of complexes 
anchored on the solid surface can be accomplished in a number 
of ways. Where the previously non- immobilized component is 
pre- labeled, the detection of label immobilized on the 
20 surface indicates that complexes were formed. where the 
previously non- immobilized component is not pre -labeled, an 
indirect label can be used to detect complexes anchored on 
the surface; e.g., using a labeled antibody specific for the 
previously non- immobilized component (the antibody, i-n turn, 
2s may be directly labeled or indirectly labeled with a labeled 
anti-Ig antibody). 

Alternatively, a reaction can be conducted in a liquid 
phase, the reaction products separated from unreacted 
components, and complexes detected; e.g., using an 
immobilized antibody specific for mahogany gene product or 
30 the test compound to anchor any complexes formed in solution, 
and a labeled antibody specific for the other component of 
the possible complex to detect anchored complexes . 
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labeled mahogany pens- product., using mahogany gene crosuc; ir. 
a manner similar tc. the well known technique c: antibody 
probing c: Agtil j ibraraes . 

One method that detects protein interactions ;r. vivc 
c the two-hybrid system, is described m detail for 

illustration only anc not by way of limitation. One version 
of this system has been described (Chi en, et al . , 19B2, Prcc . 
Natl. Acad. Sci . USA, BE, 957E-S5S2) and is commerci al 1 y 
available from Clontech (Pale Alto, CA) 
^ Eriefly, utilising such a system, plasmids. are 

constructed that encode two hybrid proteins: one consists of 
the DNA- binding domain of a transcription activator protein 
fused to the mahogany gene product and tne other consists of 
the transcription activator protein's activation domain fused 
to an unknown protein that is encoded by a cDNA that has been 
recombined into this piasmid as part of a cDNA library. The 
DNA-bindang domain fusion piasmid and the cDNA library are 
transformed into a strain of the yeast Saccharomyces 
cerevisiae that contains a reporter gene (e.g., HE'S or lacZ) 
whose regulatory region contains the transcription 
activator's binding site. Either hybrid protein alone cannot 
activate transcription of the reporter gene: the DNA-bi-nding 
domain hybrid cannot because it does not provide activation 
function and the activation domain hybrid cannot because it 
cannot localize to the activator's binding sites. 
Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the 
reporter gene, which is detected by an assay for the reporter 
gene product . 

The two-hybrid system or related methodologies may be 
used to screen activation domain libraries for proteins -that 
interact with the "bait" gene product. By way of example, 
and not by way of limitation, mahogany gene products may be 
used as the bait gene product. Total genomic or cDNA 

-55- 



0006373A2.L> 



WO 00/0537? 

PCT/US99/16484 

seoue.nce, are fused to the DNA encoding an 

This 2ibra-v a n ri - . «°-ng on activation domain 

= -> -no a plasmid encoding a hybrid of - H ■ 
mahogany ?e „e product fused £o °* " b " £ 

— - —porter .Z^^r" 
5 cr.n.tor»„„ are screened for those that . "-""9 
reporter gene. Po r exampie a b -T h *** 
such as tne open reading'fram "~ 
cloned into a vector such that i* raaho 9 a ^ 9»ne. can oe 

These colonies are purified and the library , """^ 
responsible for reporter gene exprls ol Ire ^ 
seguencmc is then used to identify t e prote 
the library plasmid.. . Proteins encoded by 

A cDNA library of the ceil n / 
interact with bait mahogany gene ^ ■ 

» can be made using methods rout! e^II JIL" 

According to the particular system" ™ th « »« . 

-ample, the c DN A fragments ca„ 11 III™ T^' 
■-ch that they are translationaliy fu d ^ ^ ' 
transcriptional activation domain of GflL < ! , 
20 can be co- transformed along with °[ ^ ' * "*"ry 

fusion plasmid into a yeast ■ " V ">^°9any gene-GAL4 

-iven by a promoter th! on ta s" GAM """^ 8 ^ 
A cDNA encoded protein, fused to I 

activation domain that interact T tran «riPtional 

2S Product „iu reconstitute In a ,::^"^ 9 ^ ~ 
drive expression of the HIS3 gene 
HIS3 can be detected by their grit h 

containing semi-solid aga r basl °" dl * h " 

- c DN A can then be ^t^^^ 

:::i::: ; " 5 protein — * practlced in 



56 - 



BNSDOCID. <WO O0OS373A2 I > 



WO 00/0537? PCT/US99/1 6484 



5.4.2.2. ASSAYS FOR COMPOUNDS THAT INTERFERE 
WITH MAHOGANY GENE PRODUCT 
MACROMOLECULE INTERACTION 

The mahogany gene products may, ±n vivo, interact with 
5 one or more macromoiecules , such as proteins. For example, 
the mahogany gene products may, in vivo, interact with the 
mahoganoid gene products. Other macromoiecules which 
interact with the mahogany gene products may include, but: are 
not limited to, nucleic acid molecules and those proteins 
10 identified via methods such as those described, above, in 

Sections 5.4.2.1 - 5.4.2.2. For purposes of this discussion, 
the macromoiecules are referred to herein as "binding 
partners". Compounds that disrupt mahogany gene product 
binding to a binding partner may be useful in regulating the 
activity of the mahogany gene product, especially mutant 
mahogany gene products. Such compounds may include, but are 
not limited to molecules such as peptides, and the like, as 
described, for example, in Section 5.4.2.1 above. 

The basic principle of an assay system used to identify 
compounds that interfere with the interaction between the 
2 0 mahogany gene product and a binding partner or partners 

involves preparing a reaction mixture containing the -mahogany 
gene product and the binding partner under -conditions and for 
a time sufficient to allow the two to interact and bind, thus 
forming a complex. In order to test a compound for 
25 inhibitory activity, the reaction mixture is prepared in the 
presence and absence of the test compound. The test compound 
may be initially included in the reaction mixture, or way be 
added at a time subsequent to the addition of mahogany gene 
product and its binding partner. Control reaction mixtures 
are incubated without the test compound or with a -compound 
which is known not to block complex formation. The formation 
of any complexes between the mahogany gene product and the 
binding partner is then detected. The formation of a complex 

- 57 - 



BNSDOCtO: <WO 0005373A2_L> 



WO 00/05373 



PCT/US99/16484 



15 



m the ccr.crcl reactio- n--* 

^ — - tne reaction r.;x:ure 

con.t a i ni-nc t ne t-e-s t compound - n ~ - , ^ 0 - w - * ^ 

- ^ , ^c.^es tnat trie comoounc 

interferes witr. the lnte^ac — r -' ^ c 

— °- manoganv cent r~c~u^ 
and the rmdmc partner. Acc:t^r-.-. v I 

^ -> ' »-wui^^ ca i crma t ~ 

5 within reaction mixtures containing the test compound a-' 
normaJ mahogany gene product may aiso be compared tc ccmr.e:- 

formation within reaction m-y-^cc ■ ■ ■ 

cav^c^u.. m *>--re$ containing the test 

compound and a mutant mahocanv oe « — oduct " tv,- 

- .--o^uct. This comparisor 

may De important in those cases wherein it is des-a*-.! ^ 

20 id€ntify COn,p ° Und£ ^at disrupt interactions of muting but 
not norma, mahogany gene product 

The assay for compounds that interfere with the 

interaction or the mahoqanv aen* n— a— - K , -. 

. y^-c.E cnc binainc 

partners can C e conducted ma heterogeneous or homogeneous 
format. Heterogeneous assays involve anchoring eithe- the 
mahogany gene product or the binding partner onto a solid 
support and detecting complexes formed on the solid support 
at the eno of the reaction. I n homogeneous assays the* 
entire reaction is carried out in a li qu id phase. ' In either 
approach, the order of addition of reactants can be varied to 
2D obtain different information about the compounds beino 

tested..;. For example, test compounds that interfere with the 
interaction between the mahogany gene products and the 
binding partners, e.g., by competition, can be identified by 
conducting the reaction in the presence of the test 
>s substance; i.e., by adding the test substance to the reaction 
mixture prior to or simultaneously with the mahogany gene 
product and interactive intracellular binding partner 
Alternatively, test compounds that disrupt preformed 
complexes, e.g.. compound, with higher binding constants that 
displace one of the components from the complex, can be 
tested by adding the test compound to the reaction mixture 
after complexes have been formed. The various formats are 
described briefly below. 
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Ir. a net eroceneouf assay system, either the mahoaanv 
oene oroduct cr the i nt e^a^t - ve bmd-:T-~ . ^ . „ ^. ■ 

onto a solid surface .. while the non -anchorec species is 
labeled, either directly cr indirectly. Ir. cract-ce 
5 nucrotiter plates are conveniently utilized. The anchored 
species may be immobilized by non-covaient cr ccvalent 
attachments. Non- covai en: attachment may be accomplished 
simply by coating the solid surface with a solution of the 
mahogany gene product or binding partner and crvinc . 
10 A ' tern£t - vei y , an immobilized antibody specific for the 

species to be anchored may be used tc anchor the species to 
the solid surface. The surfaces may be prepared in -advance 
and stored . 

In order tc conduct the assay, the partner of the 
immobilized species is exposed to the coated surface with or 
15 without the test compound. After the reaction is complete, 
unreacted components are removed (e.g., by washing) and any 
complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. Where the 
20 non- immobilized species is pre-labeled, the detection of 
label immobilized on the surface indicates that .complexes 
were formed. Where the non- immobilized species is not pre- 
labeled, an indirect label can be used to detect complexes 
anchored on the surface; e.g., using a labeled antibody 
25 specific for the initially non- immobilized species -(the 
antibody, in turn, may be directly labeled .or indirectly 
labeled with a labeled anti-lg antibody). Depending upon the 
order of addition of reaction components, test compounds t-hat 
inhibit complex formation or that disrupt preformed -complexes 
can be detected. 

^ ft 

Alternatively, the reaction can be conducted in a li-quid 
phase in the presence or absence of the test compound, the 
reaction products separated from unrea-cted components, and 
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20 In another embodiment of the invention th.. 
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to the region of the protein involved in interactive binding 
Alternatively, one protein can be anchored to a solid surfac 
using methods described in this Section above, and allowed t 
interact with and bind to its labeled binding partner, which 
; has been treated with a proteolytic enzyme, such as trypsin. 
After washing, a short, labeled peptide comprising the 
binding domain may remain associated with the solid material 
which can be isolated and identified by amino acid 
sequencing. Also, once the gene coding for -the segments is 
engineered to express peptide fragments of the protein, it 
can then be tested for binding activity and purified or 
synthesized . 

For example, and not by way of limitation, a mahogany 
gene product can be anchored to a solid material as 
described, above, in this Section by making a GST-1 fusion 
protein and allowing it to bind to glutathione agarose beads. 
The binding partner can be labeled with a radioactive 
isotope, such as 3S S, and cleaved with a proteolytic enzyme 
such as trypsin. Cleavage products can then he added to the 
anchored GST-1 fusion protein and allowed to bind. After 
washing away unbound peptides, labeled bound material, 
representing the binding partner binding domain, can be 
eluted, purified, and analyzed for amino acid sequence by 
well-known methods. Peptides so identified can be produced 
synthetically or produced using recombinant DNA technology. 

5.4.2.4. ASSAYS FOR THE IDENTIFICATION OF 
COMPOUNDS THAT AMELIORATE BODY 
WEIGHT DISORDERS 

Compounds, including but not limited to .bindi-ng 
compounds identified via assay techniques such as those 
described, above, in Sections 5.4.2.1 - S.4.2.3, can be 
tested for the ability to ameliorate body weight -disorder 
symptoms, including obesity, cachexia, and anorexia. It 
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containing a human or altered form of mahogany gene, may be 
used to identify compounds capable of ameliorating symptoms 
of the disorder. Such animal models may be used as test 
substrates for the identification of drugs, pharmaceuticals, 
5 therapies and interventions. For example, animal models may 
be exposed to a compound suspected of exhibiting an ability 
to ameliorate, symptoms, at a sufficient concentration and for 
a sufficient time to elicit such an amelioration of body 
weight disorder symptoms. The response of the animals to the 
exposure may be monitored by assessing the reversal of the 
symptoms of the disorder. 

With regard to intervention, any treatments that reverse 
any aspect of body weight disorder- like symptoms should be 
considered as candidates for human therapeutic intervention 
in such a disorder. Dosages of test agents may be determined 
15 by deriving dose- response curves, as discussed in Section 
5.5.1, below. 

5.4.3. COMPOUNDS AND METHODS FOR THE TREATMENT 
OF BODY WEI45HT DISORDERS 

2Q Described below are methods and compositions whereby 

body weight disorders, including obesity, cachexia, and 
anorexia, may be treated. Such methods -can comprise, for 
example administering compounds which modulate the expression 
of a mammalian mahogany gene and/or the synthesis or activity 
of a mammalian mahogany gene product, so that symptoms of the 

25 body weight disorder are ameliorated. Alternatively, in 
those instances whereby the mammalian body weight disorder 
results from mahogany gene mutations, such methods can 
comprise supplying the mammal with a nucleic acid molecule 
encoding an unimpaired mahogany -gene product such that an 

30 unimpaired mahogany gene product is expressed and symptoms of 
the disorder are ameliorated. 
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the activity, expression or synthesis of the mahogany gene, 
including the ability to ameliorate the symptoms of a 
mammalian body weight disorder, are antisense, rifoozyme, and 
triple helix molecules. Such molecules may be -designed to 
5 reduce or inhibit either unimpaired, or if appropriate, 

mutant target gene, activity. Techniques for the* product ion 
and use of such molecules are well known to those of skill in 
the art . 

Ant i sense RNA and DNA molecules act to directly block 
iQ the translation of mRNA by hybridizing to targeted mRNA and 
preventing protein translation. Ant i sense approaches involve 
the design of oligonucleotides that are complementary to a 
target gene mRNA. The antisense oligonucleotides will bind 
to the complementary target gene mRNA transcripts and prevent 
translation. Absolute complementary , although preferred, 
15 is not required. 

A sequence "complementary" to a portion of an RNA, as 
referred to herein, means a sequence having sufficient 
complementary to be able to hybridize with the RNA, forming 
a stable duplex; in the case of double- stranded antisense 
20 nucleic acids, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complement arily 
and the length of the antisense nucleic acid. Generally, the 
longer the hybridizing nucleic acid, the more base mismatches 
25 with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be) . One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard 
procedures to determine the melting point of the nybridized 
complex. 

In one embodiment, oligonucleotides complementary to 
30 non-coding regions of the mahogany gene could be used i-n an 
■ antisense approach to inhibit translation of endogenous 
mahogany mRNA. Antisense nucleic acids should be at least 
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the blood-brain barrier (see, e.g., PCT Publication -No. 
WO89/10134, published April 25, 1988), hybrid! zat ion- 
triggered cleavage agents (see, e.g., Krol ec al . , 1988, 
EioTechnigues 6, 958-S76) or intercalating agents (see, e.g. , 
5 Zon, 1988, Pharm. Res. 5, 539-549). To this end, the 

oligonucleotide may be conjugated to another molecule, -e.g., 
a peptide, hybridization triggered cross -linking agent, 
transport agent, hybridization- triggered cleavage agent, etc. 
The antisense oligonucleotide may comprise at least one 
iQ modified base moiety which is selected from the group 

including but not limited to 5-f luorouracil , 5 -bromouracil , 
5-chlorouracil, 5- iodcuracil . hypoxanthine , xanthine, 

4- acetylcytosine, 5 - (carboxyhydroxyl methyl ) uracil, 
5 - carboxyme thyl aminome t hy 1 - 2 - 1 hiouridine , 

5- carboxymethylaminomethyluracil , dihydrouracil , beta-D- 
15 galactosylqueosine, inosine, N6- isopenteny lade nine .. 

1- me thyl guanine, 1 -methyl inosine , 2 , 2 -dimethylguanine , 

2- methyladenine, 2 -methyl guanine, 3 -methylcytosine , 
5- methyl cytosine, N6-adenine, 7-methylguanine , 

5-methylaminomethyluracil , 5 -methoxyaminomethyl -2 - thiouracil , 
20 beta-D-mannosylqueosine, 5 '-methoxycarboxymethyluracii , 
5-methoxyuracil, 2 -methyl thio-N6 - isopentenyl adenine , 
uracil -5-oxyacetic acid (v) , wybutoxosine , pseudouracil , 
gueosine, 2-thiocytosine , 5 -methyl -2- thiouracil , 
2 -thiouracil, 4 -thiouracil , 5-methyluracil , uracil - 
25 5-oxyacetic acid methylester, uracil -5-oxyacetic aoid fv) , 
5-methyl-2-thiouracil, 3- (3-amino-3-N-2-.carooxypropyl) 
uracil, (acp3)w, and 2 , 6 -diaminopurine . 

The antisense oligonucleotide may also -comprise at least 
one modified sugar moiety selected from the group including 
but not limited to arabinose, 2-f luoroarabinose , xylulose, 
30 and hexose . 

In yet another embodiment, the anti-sense oligonucleotide 
comprises at least one modified phosphate -backbone selected 
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c - - ana c formarpr^i =- = ■> 
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• — oligonucleotide. An a - anomeric 

oiigonucieotide forms specific double-str-nd^ v, • - 
complementer rna in which stronaed hyenas with 

thp . WniCh ' c °ntrary to the usual 3-units 

the strancs run narali^i - units, 
198 7 m - Unparallel toeacn other (Gautier, et al 
1987, Nuc. Acids Res. is. 6625-6641) ' The oi- 
ls a 2'-o m^hv! -w The oli 9onucleotide 

0 methylrabonucleotide (Inoue, et aj 1987 M - 
Acids Res. 15 613 i fil - D > ' 1987 ' Nu ci . 

'Inoue et ' or a chimeric RNA-DNA analogue 

Oligonucleotides of 

fro, B.osearch Apnlied „■ commercially, available 

j-i-u, Applied Biosystems *>n- \ * 
phosohorothioate oliann i As examples, 

20 controlled core a i SC c - Prepared by use of 

aeo. pore glass polymer supoorts (Sarin ^ , ' 

Proc. Natl. Acad. Sci USA « , ' &1 " ' 198£ < 

•=><-!- U.S.A. 85, 7448-7451) etc 

Whale antisense nucleotides complementary to'tv 

9 ene coding region se q uence could he used Z s ^ 

complementary to the transcribed, ung ate egion a 

25 most preferred. region are 

Antisense molecules should be delivered to . n . 
express the target oene i„ „• ellve ^d to cells that 

been developed Tor Ll A ° f Mbod * h * Ve 

antisete ;r:i: :r isense ™ ~ — 
30 — — .« — a ::: s :: s rri:: zi:: 7 in : the 

target the desired ceils ( e a a ^ • ^signed to 
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expressed on zhe targe: ceil surface, can be £cm:n:£ierec 
system: cai 2 y . 

However, :i :s oiter. difficult to achieve intracellular 
concentrations c: the am i sense sufficient tc suppress 
t translation c: endogenous mRNAs . Therefore a preferred 
approach u::::zes a reconacman: DNA construe: in v;m cn the 
antisense cu i gonuci eot i de : £ placed under the control of a 
strong pol ZZZ or pel -I Z promoter . The use o: such a 
construct tc transfect target cells in the patient will 
result m the t ranscriot i on o: sufficient amounts of single 

0 

stranded RNAs that wu^ rorn complementary base pairs w:th 
the endogenous target gene transcripts and thereby prevent 
translation c: the targe: gene mRNA . For example, a vector 
can be introduced e.c., such that it is taken up by a ceil 
and directs the transcript : c :\ c: an antisense RNA . Such a 

- vector can remain episomal or become chromosomal ly 

integrated, as long as it can be transcribed tc- produce the 
desired antisense RNA.. Such vectors can be constructed by 
recombinant DNA technology methods standard in the art . 
Vectors can be plasmid, viral, or others known in the art, 

q used for replication and expression in mammalian cells. 

Expression of the sequence encoding the antisense RNA can be 
by any promoter known in the art to act in mammalian, 
preferably human cells. Such promoters can be inducible or 
constitutive. Such promoters include but are not limited to: 
the SV40 early promoter region (Bernoist and ChaTnbon, 1981, 

5 

Nature 290, 304-310), the promoter contained in the 3' long 
terminal repeat of Rous sarcoma virus (Yamamoto, et al., 
1980, Cell 22, 787-797), the herpes thymidine kinase promoter 
(Wagner, et al . , 1981, Proc . Natl. Acad. Sci . U.S.A. 78, 
1441-1445), the regulatory sequences of the metal loth ione in 
0 gene (Brinster, et al., 1962, Nature 296, 35-42), -etc. Any 
type of plasmid, cosmid, YAC or viral vector can i>e used to 
prepare the recombinant DNA construct which can <be introduced 



WO 00/05373 

PCT/US99/16484 

Erectly in tc the tissue site Alte-n-r ■ n 

can ce usee ->-. = ,- -^i • - y ' viro1 vectors 
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can also jDe uqph ~ 
translation o* tar 0 - t n ° P rev ent 

c target gene mRNA and, therefore «, 
target gene product. (S ee, e.g PCT ' ' eXpreSSion °* " 

Pllhl ^ _ PCT International 

Publication W0*0/ ai364 , published October , • 1990 . Sarv 
10 1990, Science 24?j 1222 . 1225) _ 199 °' Sa ^r, et 

Ribozymes are enzymatic RNA molecules capable * 
catalyzinc the « M Hf,> , capable of 

tne specific cleavage of rna (Fot- a 
Rossi, 19M . Cu ,rent Biolo 9y 4 . * ™' — 

rlbOSy ~ «"» '-civ., seance specific hvbr " " 

the ribo 2y me moiecu.e to compWntLy ^.T^T " 
5 by an endonucleolytic cleavaae followed 
ribozy*: mdecuies must indude ITo. 2l ^ ^ °' 
complementary to the target 9 ene Se "uences 
wen known catalytic « ' ^ mUSt include 

- thi S .^r ..rrr n po r bie for ^ 

,0 which is incorporated herefh'b t £ ltlZ , 
While ri b o 2y mes that c le av e ^ t ". «»""'V- 
reception sequences can be used Vo - SPeC1 " C 
T s the use of hammerhead ^Z^Z^Z ~ - 

25 TZZtZ^JT":-" " ------ted by: 

25 y ons tnat form complementary x»» ■ 

tar 9 et The S o!e r^ireLnt " tha t t h 7" 

bave the t o lloMn9 Mguence of _ ^ , , ; 

construction and production of h«»m. • - 5 u °- 3 - The 
Known in the art and i. „ hammernead ri-bozymes is well 

■ art and 1E described more fullv in ' m 
Molecular B iolo Sy an d Biotecnnolo 3y • A ^1 h "T"' 1SS5 ' 
Terence. VCH Publishers. N e„ yo ^' * C °"~— ^ / 

«' - - Haselof f anH en cT . 

lyee, Nature, 334, 
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585-5?: ( which is incorporated herein by reference in its 
entirety . 

Preferably the ribozyme is engineered so that the 
cleavage recognition site is located near the 5' end of the 
5 target gene mRNA, i.e., to increase efficiency and minimize 
the intracellular accumulation of non- functional mRNA 
transcripts . 

The ribozymes of the present invention also include RNA 
endoribonucleases (hereinafter "Cech- type ribozymes") such as 
the one that occurs naturally in Tetrahymena thermophila 
(known as the IVS, or L-19 IVS' RNA) and that has been 
extensively described by Thomas Cech and collaborators (Zaug, 
et aJ. f 1984, Science, 224, 574-578; Zaug and Cech, 1986, 
Science, 231, 470-475; Zaug, et al . , 1986, Nature, 324, 429- 
433; published International patent application No. WO 
5 88/04300 by University Patents Inc.; Been and Cech, .1986, 
Cell, 47, 207-216). The Cech-type ribozymes have an eight 
base pair active site which hybridizes to a target RNA 
sequence whereafter cleavage of the target >RNA takes place. 
The invention encompasses those Cech-type ribozymes which 
0 target eight base-pair active site sequences that are present 
in the target gene. 

As in the antisense approach, the ribozymes can be 
composed of modified oligonucleotides (e.g., for improved 
stability, targeting, etc.) and should be delivered to cells 
g that express the target gene in vivo. A preferred method -of 
delivery involves using a DNA construct "encoding" the 
ribozyme under the control of a strong constitutive pel III 
or pol II promoter, so that transfected cells will produce 
sufficient quantities of the ribozyme to destroy endogenous 
target gene messages and inhibit translation. Because 
ribozymes unlike antisense molecules, are oatalytio, .a lower 
intracellular concentration is required for efficiency. 
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single stranded and composed of deoxynucleotides . The base 
composition of these oligonucleotides must be designed to 
promote triple helix formation via Hoogsteen base pairing 
rules, which generally require sizeable stretches of either 
5 purines or pyrimidines to be present -on one strand of a 

duplex. Nucleotide sequences may be pyrimidine - based , which 
will result in TAT and CGC triplets across the three 
associated strands of the resulting triple helix. The 
pyrimidine-rich molecules provide base complementary ly to a 
purine-rich region of a single strand of the duplex in a 
parallel orientation to that strand. In addition, nucleic 
acid molecules may be chosen that are purine-rich, for 
example, contain a stretch of G residues. these molecules 
will form a triple helix with a DNA duplex that is rich in GC 
pairs, in which the majority of the purine residues are 
15 located on a single strand of the targeted duplex, resulting 
in GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be 
targeted for triple helix formation may be increased by 
creating a so called "switchback" nucleic acid molecule. 
20 Switchback molecules are synthesized in an alternating S'-3', 
3 '-5' manner, such that they base pair with first -one strand 
of a duplex and then the other, eliminating the necessity for 
a sizeable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 
25 In instances wherein the antisense, -ribozyme, and/or 

triple helix molecules described herein are utilized to 
inhibit mutant gene expression, it is possible that the 
technique may so efficiently reduce or inhibit the 
transcription (triple helix) and/or translation (anti-sense, 
ribozyme) of mRNA produced by normal -target gene alleles that 
30 the possibility may arise wherein the concentration of -normal 
target gene product present may be lower than is -necessary 
for a normal phenotype. In such cases, -to ensure that 
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substantially normal levels ^ 
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' nucieic ac-id molecules -h^ 
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- , . target gene activity 

An.i -sense RNA and DNA, ribozyme and t ■ - 
-iecuies of the invention mav be prepared b ^ 
known in the art for th« I Prepared by any method 

as discussed above. ^J^^ °' ^ "* ^ n ° le « 1 «. 
» synthesizing oli.odeo^^ ^ — ly . 

^^ribonucleotides well known in the ' 
example solid phase phosphor amidit. ! " £ ° r 

Alternatively. ^ molecLTL b ^ • 
*■ inscription of £ " ^ ^™ ^ * — and . 

20 RNA molecule. Such DNA sequences may b ^ """We- 

wide variety of vectors th!t incorporated into a 

— — such th i: ™ 

promoters. Alternatively an t-H °' SP6 Polymerase 

!5 depending on the promoter used caTb lnduclb1 ^ 

cell lines. . ' C3n be introduced stably into 

Mahogany gene nucleic acid ' 
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norma j mahogany gene or a portion of the mahogany -gene that 
directs the production of a mahogany gene product exhibiting 
normal mahogany gene function, may be inserted into the 
appropriate cells within a patient, using vectors that 
5 include, but are not limited to adenovirus, adeno-associated 
virus, anc retrovirus vectors, in addition to other particles 
that introduce DNA into cells, such as liposomes. 

Because the mahogany gene is expressed in the brain, 
such gene replacement therapy techniques should be capable 
^ delivering mahogany gene sequences to these cell types within 
patients. Thus, in one embodiment, techniques that are well 
known to those of skill, in the art (see, e.g., POT 
Publication No. WO89/10134, published April 25, 1S&8) can be 
used to enable mahogany gene sequences to cross the blood- 
brain barrier readily and to deliver the sequences to cells 
15 in the brain. With respect to delivery that is capable of 
crossing the blood-brain barrier, viral vectors such as, for 
example, those described above, are preferable . 

In another embodiment, techniques for delivery involve 
direct administration of such mahogany -gene sequences to the 
20 site of the cells in which the mahogany gene sequences are to 
be expressed. 

Additional methods that may be utilized to increase the 
overall level of mahogany gene expression and/or mahogany 
gene product activity include using target homologous 
recombination methods, discussed in Section S.2 # above, to 
modify the expression characteristic of an .endogenous 
mahogany gene in a cell or microorganism -by inserting a 
heterologous DNA regulatory element such that the inserted 
regulatory element is operatively linked with the endogenous 
mahogany gene in question. Targeted homologous recombination 
30 can be thus used to activated transcription of an endogenous 
mahogany gene that is "transcriptionally silent", i.-e., is 
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not normally expressed, or to enhance ^ ' 
-oogenous mahogany gene ^ ^ , ~ °" ™ 

Further, the overall level of ™=u 

. mano9any gene produ ; c h :rL 9 i:: r express2on 
5 intr a U c ti0 n ef aPPropriate mah09any r ex ; r y e : , 9 n = - «- 
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— .... Pref e r a Uy ::r::: s :::/L; r ::r-; m 

mahogany gene. Alternatively, cells 
cells, can be engineered to express J h * 
"-W then he introduced into a 1^ " ^ 

» appropriate tor the a^elior t on of tL'h P ° Siti ° nS 

^Ptoms. Alternately, cells ^ Z^LZtLZ^ 

2 „ tne appropriate gene regulatory sequences to allow = uch 
expression ln the necessary cell types Such 

secuences are well Known to the s K m! d A ^ re9ul " : ~l' 

9 ene therapy techni OU es ar „ k ^T' ~"- 
in the art, see e a a„h" th ° Se ski "ed 

. see, e. 9 ., Anderson, U.S. Patent No. S..399 34 = 
25 When the cells to be administered are non * 

cellc r-u^,, , ea are "on- autologous 

cells, they can be administered usina w*n i, 

cells from developing. For example, the cells may be 
-troduceo » an encapsulated form which. „ hi l. Jawing for 
an exchange of components with the immediate extracLl , 
environment, does not allow the introduced LlTC^ 
recognized by the host immune system. 
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Additionally, compounds, such as chose identified via 
tecnnioues such as those described, above, in Section 5 4 2 
that are capable of modulating mahogany gene product activitv 
can r>e aamir.istered using standard techniques that are well ' 
£ Known tc those of skill in the art. m instances in which 
the compounds to be administered are to involve an 
interaction with brain cells, the administration techniques 
snould include well known ones that allow for a crossing -of 
the blood-brain barrier. 

10 

5.5. PHARMACEUTICAL PREPARATIONS AND 
METHODS OF ADMTN ISTRATTnxT 

The compounds that are determined to affect mahogany 
gene expression or gene product activity can be administered 
to a patient at therapeutically effective doses to treat o^ 
15 ameliorate body weight disorders, such as obesity, anorexia 
or cachexia. A therapeutically effective dose refers -to that 
amount of the compound sufficient to result in amelioration 
of symptoms of such a disorder. 

20 5.5.1. EFFECTIVE 

Toxicity and therapeutic efficacy of such compounds can 
be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the 
LD 50 (the dose lethal to 50% of the population) and the ED 50 
^ (the dose therapeutically effective in SO% of the 

population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as 
the ratio LD £0 /ED E0 . Compounds that exhibit large therapeutic 
andices are preferred. While compounds that exhibit toxic 
sade effects may be used, care should be taken to desi-gn a 
30 oelivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 
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The data obtained from the cell culture 
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te °" ^ministration utilized p 

~ ^ "» of the ime „Uo„ the ^ 

therapeutically effective dose can K 
JD from cell culture assays * i» ltl . lly 

10 to achieve a ^ ^^17^ £ ° rmUlat * d ln «^ 

that includes the lc " . " «"*«"«««. ranee 
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liquid chromatography * "^ Performance 

Pharmaceutical compositions for use in a „ „ 
20 the present invention may be formulated aCC ° rdan « ^» 

manner usin g on e or more P^lZTclltyZlZZT^ 

or excipients. Y acce P tat >le carriers 

Thus, the compounds and thf»ir • , 

salts and soivates may oe to^^^T^ 

„ inhalation or insufflation (either th™ 
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lubricants (e.c., rriacnesiuir. stearate, talc cr s 
disintegrants fe.c., pctatc starch cr sodium starch 
glycolate) ; cr wetting agents (e.c., sodiurt iaun-j sulphate; 
The tablets may be coated by methods well known ir. the art. 
; Liquid preparations r cr era! administration may take tne for 
of , for example, solutions, syrups or suspensions, or thev 
may be presented as a cry product for constitution with wate. 
or other suitable vehicle before use. Such licuic 
preparations may be prepared by conventional means with 
pharmaceut icaily acceptable additives such as suspend mo 
agents (e.c., sorbite! syrup, cellulose derivatives or 
hydrogenated edible rats; ; emulsifying agents (e.g., iecithir 
or acacia); non-aqueous vehicles (e.g., almonc oil, oily 
esters, ethyl alcohol cr fractionated vegetable oils) ; .and 
preservatives (e.g. , methyl or propyl - p- nydrcxybenzcat es or 
sorbic acid) . The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
appropriate . 

Preparations for oral administration may be suitably 
formulated to give controlled release of the active compound. 

For buccal administration the compositions .may take the 
form of tablets or lozenges formulated in conventional 
manner. 

For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, 
e.g. , dichlorodif luoromethane , trichlorof luoromethane , 
dichiorotetraf luoroethane, carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g., gelatin for use in 
an inhaler or insufflator may be formulated ^containing a 
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powder mix ct the compound sultah1o 

lactose or sr «~.-„ suitable powder base such as 

.«...i;:--:::~r,r;j~'\v:; 1 rr""' 

dose containers with an ad * * ampoules or in multa- 
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:v n ::;;r orm *- — - -T^rt^r 

e-9... sterue pyrogen r free water, before use 
The co-pound, may also be formulated in 
compositions , ueh suppositories or v . eent . • Ct * 1 

lon 9 ac tin g formulation£ may ^ -J - - 
20 implantation (for example subcutaneously or * 

or by intramuscular injection. Thus fl 1Mr » K " 1 «ly) 
compounds may be formulated with ^ ^ ^ the 

hydrophobic material (for exam., P°^eric or 
acceptabie oil, or ion el Z7 sI^rT^ *" 

25 soluble derivatives, for example ' " " 

salt. 3 s P ar mgly soluble 

or diit::rT:::: n : h i; y ma " — • - ~— 

forms contai„i„ g ^rC^L"" ZT"^ ^ 
example comprise metal or plastic fll ! * ^ 

"» Pacfc. The P ac k or dispen se r ^ c ~ y b 7 " 3 

instructions for administration -""Panied by 
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6. EXAMPLE: GENETIC AND PHYSICAL MAPPING 

OF THE MAHOGANY LOCUS 

In the Example presented herein, studies are described 
which, first, define the genetic interval on mouse chromosome 
5 2 within which the mahogany gene lies, and second, 

successfully narrow the interval to approximately 0.2 9 cM. 
Further, the physical mapping of this interval is described. 

Mouse crosses were performed to obtain homozygous mg/mg 
mice. First, LDJ-Le-mg mice were crossed with CAST/Ei mice. 
The Fls were back-crossed with LDJ-Le-mg mice and the 
0 resulting litters scored for coat color. Mice showing coat 
color of mg/mg homozygotes were genotyped to using D2/NDS3 
and D2/MIT19 markers to identify meiotic events. Mice 
showing recombinant events were fine structure mapped using 
various markers shown in FIG. 1. All genotyping was 
5 performed using PCR-SSLP and then analyzed using PAGE. 

After 2300 meiosesis, the mahogany gene was mapped to a 
0.99 cM interval FIG. 1. This corresponded to an interval 
width of 700 kb. 

Physical Mapping of the Genetic Interval • The 700 kb 
mahogany region on mouse chromosome 2 is shown in FIG. 1. 
Genetic markers, clones spanning the region and open reading 
frames in the interval are shown in the figure. 

7. EXAMPLE: IDENTIFICATION OF A CANDIDATE 
MAHOGANY -GENE 



In the Example presented herein, a gene is identified 
within the cloned DNA described in the Example in 'Section -6, 
above, which corresponds to a candidate mahogany gene . 

Clones spanning the 700kb -region were sequenced, and open 
reading frames were identified and analyzed through this 
interval . Nucleic acid sequencing was performed -using ABI 
sequencers and the manufactures recommended procedures. Many 
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analysis, was used with c tn. e( /--- " V " SSCT 

5 pnenocype mice tc :de-^f- , ■ mc;i ^ Gn -'' ■ 

-than thus interval c^!.^^ " the . ^ Racine frames 

P m „ - * C mUt£tira an mc m ouse 

mutation was found — one o* * >, 

= Provide th . genomlc c3m 8 ^ J ^;. n : C : - 

mutator, „G. e show, <v 'T '""^^ *•» 
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The mutator, „ mciJ ~ .\ ^ °' tn * ■«*» ■* 9.n« , 



wnich results -.r r ' " sequence 

chromosomal locs^rr * baSec °"- tne 

Ce " c " cnc mutation idertsf, r 

Provided in F:oure ' and * — Co^o,., the cDNA 

15 -nta.ns the contias .rov^c DNA. which 
gene/locus. " * * ^ >^ure - represent the rag. 

Further anajysis o^ enwi ,-o 
provides an analysls Qi th£ t ^ ^ 7 

20 va riants . denoted akm3003 and akml0O r i.° f F g h : r : 

provide the nucleic acid an ri „ • Fl 9 u "s 8 and. 9 

ends of these -lie ° ■" eld -»*-*• <* the S . 

Protein ^T^"^ 

9 en e: . long form (FIG 3 ° f Che h — °rthc lo9 . of the ^ 

~> - — h is shown .."L^n^r - pw ~ 

8 • EXAMPLE : CHARACTERIZATION OF 

THE MAHnqANv r-r-rrp 



in the example presented herein, the nucleic acid 
sequence of the mahogany gene transcript identic T 
example presented in Section 7 , above , ~. ^ ^ 

& usea to generate 
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Northern analysis data which characterize the expression of 
the mahogany transcript in a number of tissues both of wild 
type mice, and of mice exhibiting the mahogany phenotype . 
The results presented in this example are consistent with the 
5 mg gene being the mahogany gene. 

For Northern analysis, polyA RNA was isolated from wild- 
type and the original mg mutant, mg3J and mg-Lester mice and 
utilized from the Northern analysis following -standard 
protocols. Northern blots prepared from this mRNA was 
iq hybridized with a probe obtained from sequences common to the 
akml003 and akml004 sequences. Specifically PCR primers 
TTCCTCACTGG and GGACACACAG were used to amplify cDNA from the 
akml003 sequence which had been radio! abel led by random 
priming using a Gibco-BRL kit according to the manufacturer's 
recommended protocol . 
15 An mg transcript was found in all mice examined in mRNA 

isolated from brain (minus the hypothalamus), kidney, heart, 
testes, liver, skin, and hypothalamus. No expression was 
seen in muscle. 

In a Northern blot run on RNA samples from mahogany 
20 mice, the mg transcript was found to be expressed at a 

reduced level in all tissues in mRNA isolated from mg3J mice, 
as a varied size fragment in mg-Lester derived mRNA, and at 
different levels and sizes in original mg mutant mice derived 
mRNA. 

2s These results are consistent with the mg gene disclosed 

herein as being the mahogany gene. 

9 . EXAMPLE : EFFECTS OF THE MAHOGANY <3ENE 

ON GENE TIC AWT) DIETARY OBESITY 

This section describes experiments which examine whether 
30 the mg gene acts specifically within the agouti pathway. 

Specifically, these experiments test whether mg can suppress 
the obesity of other monogenic obese mutants as well as 
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of mrkers developed durino f h. • . ' ever growing number 
totaled 2 „ makers. ""'^ «"HV V ■ 



- 84 



BNSDOCID <WO 0O05373A2 I > 



WO 00/05373 PCT/US99/J6484 



9 -2 . RESULTS 

The murine mahogany (mg) gene is known to act in a 
dosage dependent manner within the agouti pathway, to 
compensate for the agouti overexpression and for lack of 
5 signaling from the nul allele Mclr (Miller, K.A. et al . , 
1991, Genetics 245:1407-1415; Dinuiescu, D.M. et al . , Proc . 
Natl. Acad. Sci . , in press ; Robbins, L.S. et aJ . , 1993, cell 
72:827-834) . The phenotype of mice homozygous for both mg 
and a null allele of Mclr (recessive yellow, Mclr 5 ) is 
10 yellow, the same as the phenotype of Mclr* /Mclr* -mice, 
indicating that mg is not acting downstream of Mclr. A 
similar experiment was performed with obese Mcr4 knock out 
mice (FIG. II). For both sexes, all the animals homozygous 
for Mc4r-/- were approximately equally obese and were heavier 
i5 than the mice wild-type at Mc4r independent of the genotype 
for mg. This data strengthens and confirms the AfcJr data 
previously published, strongly suggesting that mg acts at or 
upstream of both melanocortin receptors. 

To test whether mg acts specifically within the agouti 
pathway, experiments were performed to determine whether mg 
can suppress the obesity of other monogenic obese mutants of 
the mouse and whether it could suppress diet -induced obesity. 
Appropriate genetic crosses were set up to product mice 
segregating mg and one of the mouse obesity mutations Cpe*", 
tub, or Lepr** such that all combinations of homozygous and 
25 heterozygous animals were on the same mix of genetic 

background. No suppression of obesity was seen for any of 
the monogenic obese mutants (FIG. 12) lending credence to the 
assumed specificity of action within the agouti pathway. To 
ask whether mg can suppress diet induced obesity -C3HeB/FeJ- 
30 mg JJ and C3H/HeJ mice were placed, at weaning, either on 

normal chow having a physiological fuel value (PFV) of 3 .£3 
kcal/gm with 9% fat, or onto a high fat -diet having a PFV of 
4.53 kcal/gm with 42.2% fat. Food consumption and .body 
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weight were measured weekly. Converting the ™m«, f * „ 

consumed tc r- s i^ ies u 9 Srams of food 
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" K<_ai/weeK and -8t kCal /wo^u 

respectively. Desna i-<= fh. ■ , K ^a-L/week, 
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10 significant difference in „ e i ght (FIG . .^"V"" 5 ' 
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sl9Dl£lCMCC bet — <* th, mice on either diet 2s " cal 

BXPERMENT. W>I1|S „, SEQUENCING 
OF THE MtHOGEKV mm ^ 

15 This section describes experiments wherein tho „ ■ 

mahogany 9ene was 3 ene t ica lly and ™ 
ap P roximate ly 0 . 6 cM interval and then sejencef 11 
murine „ 9 sequence obtained was then used to isoiate and 

r en " hUman "» -d in situ ana ^s 6£ . 

20 » 9 express.cn in Mouse tissue are aiso described and 
sequence motifs of the predicted „ G poiypaptide are 
discussed. e 

10 MATERIAT.g AMD m^nr^r. 

25 fro. f?T £ ^ J5S2EiQ3: 36,000 individua! sentences 

fro. the region were spared by BLAST (Altschul. , P ^ ' 

* 1 - 1990 ' 3 - ™- "*.«»-«., to pubiica; - 

seouence databases and anaiyzed usi„ 9 GRAIL (GU aI 7 « L 
to identify potential coding sequence. ln addition C 

:~ fr :i rrrv* — * ^ ^ 

Green P 1090 ^ 175 185 ' ^9 B. and 

re*n, P., 1998, Genome Res. 8:186-194; Gordon, D . €t a2 . , 
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IPSE, Genome Res. 5:195-202), and the resulting contigs were 
also analyzed using BLAST and GRAIL to aid in gene 
prediction. This data was displayed in ACEdb (Durbin, 
Richard and Mieg, Jean Thierry, 1991, A C. &l*gans Database, 
E Documentation, code, and data available from anonymous FTP 
servers at 2irmm.lirmm.fr, cele, mrc-lmb.cam. ac . uk , and 
ncbi.nim.nih.gov) to further visualize predicted exons and 
their relationships to each other. 

10 Northern Blot Ana]y.si P : PolyA+ RNA was extracted from 

the tissues indicated from wild-type, C3H/HeJ and the -three 
mutant alleles of mg, C3HeB/FeJ-/n^, LDJ/Le-zng, and C3H/HeJ- 
mcf, according to the manufacturer's instructions. RNA STAT- 
60 (Tel-Test, Inc., 1511 Sounty Rd. 129, Friendswood , TX 
77546) was used to isolate total RNA. PolyA+ was isolated 

15 using Poly (A) Pure™ mRNA purification kit {Ambion, Inc., 2.130 
Woodward St. #200, Austin, TX 78744). 2 of each mRNA was 
separated on a 1% agarose- formaldehyde gel, transferred to 
nylon, and hybridized with a probe for mg corresponding to nt 
990-1406 of the murine cDNA sequence with Rapi-d-hyb Buffer 

20 (Amersham LIFE SCIENCE, Gaithersberg, MD) . Filters were 
washed with O.llx SSC, 0.1% SDS and exposed to KODAK X-omat 
film overnight. 

10 .2 . RESULTS 

25 A positional cloning strategy was undertaken to identify 

the mg gene. Multiple genetic crosses were set up to produce 
second generation mice (n-2437) segregating mg whi«ch were 
used to genetically localise the mg locus (FK3 13B) . when 
the genetic map critical interval for mg was resolved to 

3q -0,6 cM physical mapping was initiated. Approximately 1 Mb 
was contiged with 30 BACs (FI<3. 13C) , most of which were made 
into random sheared libraries for shot gun sequencing. At 
completion of the project it was estimated that «S% sequence 
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hypothalamus. Overall hybridization in LDJ/Le-mg/mg looks 
equivalent to that of wild type, and the C3HeB/FeJ-.jn^//n^ 
shows an overall reduction of expression. Close examination 
of the hypothalamic region in both wild type and. mutant 
5 alleles revealed differences in the ventromedial hypothalamic 
nucleus ( VMH) . Both C3HeB/FeJ-^/^ and the LDJ/Le-ing/ing 
have reduced VMH expression "(FIG . 15) which is particularly 
interesting as many neuropeptides and receptors known to be 
involved in body weight regulation are expressed in the VMH, 
including Mc4r. 

Initially, two overlapping mouse cDNAs of 1051 bps and 
2419 bps were identified. Using these cDNAs as a starting 
point it was possible to build over 7990 bps of human 
sequences, using both the public EST database and an in -house 
database, as well as identifying one cDNA clone from a human 
liver library. The 23 ESTs used in the contiging are listed 
in Table I below. Using the derived human sequence, it was 
then possible to estimate the intron-exon boundaries within 
the mouse genomic sequence. These were verified by PGR 
amplification and sequencing. m total, 4079 bps of mouse 
20 sequence was obtained, of which 4011 bp are coding sequence. 
The mouse genomic locus spans over 160 kb, and lias 31 
identified exons, at least one of which is differentially 
spliced. 

25 TABLE I 

Gene Bank Accession # Clone ID * clone San^o 



15 



30 



NA 


NA 


Human Endothelial -Cell 
(MPI) 


AA062169 


482948 


Soares mouse P3NMF19.-5 


NA 


NA 


Human Liver (MPI) 


AA350292 


151062 


Infant Brain 


R8 7-660 


194640 


Soares Fetal Liver Spleen 
1 NFLS 
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10 



15 



20 



T69367 

T92696 

H11351 

AA350293 

AA297697 

AB011120 

AA297214 

AA298732 

AI076479 

AA771958 

R84298 

D81046 

AA378603 
D60710 

D20236 
AA345684 
H45413 
AA044305 



82898 

118881 

47626 

151062 

149184 

NA 

129808 
184690 
1676623 

1359202 

194640 

1178923 

183010 
962349 

pml235 
147210 
182870 
486349 



Stratagene Liver 

Stratagene Lung 

Scares infant Brain I NIB 

Infant Brain 

Fetal Heart II 
Human Male Brain 
Embryo, i 2 week I 
T- Lymphocyte 

Scares Total Fetus Nb2HF8 

SSST Parath ^<* tumor 
Scares Fetal 

Human Fetal Brain 
iTfujiwara) 

Synovial Sarcoma 

Clontech Human Fetal 
Brain (#6535) 

Human Promyelocyte 
Gall Bladder I 
Soares Breast 3NbHBst 

NbSr Pre9nant ute — 



The mutant mahogany alleles were alco 
chewing all intron-exon boundaries TT^TTT^' 
nt was found i„ the coding aeguence of " ** del6tl ° n at 28 ^ 
CSHeB/Fea-^/^ which intr Z^ ^ ^ ^ 
937 ' «*~ 3. of the deletion T° * 

-suit in a seriously truncated protein 1 T ^ 
interesting domains, as discusse^T * 
30 the same allele that -J^T^T? ^ ^ ^ 
The combined Northern blot ana lTl ^ ^ «~*- ^els. 

nalyS1S ' 2n hybridization 
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analysis, and sequence analysis of the mutant mg 3 * 7 allele 
strongly suggest that this gene is the mouse mahogany gene. 

The 4 011 bp of open reading frame CORF) of mouse MG 
predicts a 1336 amino acid polypeptide with molecular mass of 
; 148,706 D (FIG. 17, top sequence). BLAST searches of the 
NCBI and SwissProt protein databases identified two human 
paralogues with a similar modular architecture (KIAA05 34, 
Genbank accession no. 3043592; and MEGF8 , Genbank accession 
no. AB011541) , as well as a C. elegans homologue (YCS1_CAEEL, 
Genbank accession no. Q199B1). 

Another human protein, At tract in or DPPT-L (Duke-Cohen, 
J.S. et al., 1998, Proc . Natl. Acad. Sci . U.S.A. 93:11336- 
11341), appears to be a 1198 amino acid residue, 
approximately 134,000 D, secreted splice variant of the MG 
polypeptide. An alignment of the predicted MG (top) and 
Attractin (bottom) amino acid sequences is shown in FIG. 17. 
Attractin has not identified as being involved in the 
regulation of body weight. Rather, the protein is reported 
to mediate an interaction between T lymphocytes and monocytes 
that leads to the adherence and spreading of monocytes that 
become foci for T lymphocyte clustering (see Duke-Cohen et 
al . , supra) . 

Searching the MG polypeptide with the SMART domain tool 
(Schultz, J. et al., 1998, Proc. Natl. Acad. Sci . U.S.A. 
55:5857-5864) revealed sequence motifs that -may provide 
further clues to its biological function -(FIG 1€S, FK3. 17) 
The single transmembrane spanning MG protein ha« a large 
extracellular sequence of 1289 amino acids containing three. 
EGF domains (Nakayama, M. et al . , 1998, Genomics 52:27-34), 
two laminin-like EGF repeats, a CUB domain <Sork, P. and 
Beckmann, G., 1993, Mol . Biol. 231:539-545), a C-type lectin 
domain (Drickamer, K., 1995, Nat. Struct, aiol. 5:437-439; 
We is W. I., and Drickamer, K. , 1996, Ann. &ev. Biocbem. 
55:441-473), two pLexin-like TOpeats (Waestrini, E. et al . , 
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1996, Proc. Natl. Acad. Sci . i>3- 6 74- 678 ) „ • 

1994, J. Mol. Biol. 236:1277-1262) m u- / *' F " 

also fluently occur in 9lycosylated p rote J\ n 7 c d ™ 

that KG is heavily glycosylated that ^ » ^-ly 

interactions are essential for its £„„,„ 

containing proteins have been ZZlZ 
multiple consecutive domains ^ ' h " W 

6-sheet Kelch motifs form ^JZTjT""'"" fol »- t »»— « 

et a2., supz-a) . Unlike the other *«o i ' ' 

t- .peat is less ^T^T^ £T ' 

recognized as a triple repeat in „ 

has similarity to MET (BorK P a ^ "~ PleXi " ^ 

20 B iol. „ a , B „-5«s>. Since chin t h - ^ * ' 

has been found in S met gL fa mil T^"* ^ 
si 9 nal via tyrosine U^T^T^^T " 

25 that there is an eight amino acid \ It TZl^ 

conserved in the four proteins shown in PI<3 ! 
-use. and c. ele^ans. The conservation c f 
such widely evoiutionary diverge! ^! 

a --- ^in. POS si ble r^-^J^«- 

The fuU-X.ngth .3 polypeptide is J^"^ 
^rane-spanning protein „ lth sxtra ^ll7^ B 
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that may have a binding or gathering function as well as a 
highly conserved putative signaling motif in -the -cytoplasmic 
tail. 

5 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
as single illustrations of individual aspects of the 
invention. Functionally equivalent methods and component's 
are within the scope of the present invention. Indeed, 

10 

various modifications of the invention, in addition to those 
shown and described herein, will become apparent to those 
skilled in the art from the foregoing description and 
accompanying drawings. 

All publications and patent applications mentioned in 
15 the specification are herein incorporated by reference to the 
same extent as if each individual publication or patent 
application was specifically a*id individually indicated to be 
incorporated by reference . 

20 
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WHAT Tfi CLATMRn tc. 

1- An isolated nucleic acid molecule comprisinq the 
nucleotide sequence of SEQ id no- 1 (Flc ^ 

(PIC BA), SEQ ID NO: 10 (FIG SE ° ID N °= * 

5 SnQ xD NO: 14 ( PIG . 18A) , SEQ ID NO: 16 ( PIG 19A , I 

NO: 18 (FIG. 2 0A) . } ' ° r SEQ 1D 

2- The isolated nucleic a<--iri -. 
, . ieic acid molecule of Claiir i 

WherSln -^cule composes the nucll ide 

iq sequence of SEQ ID NO: 1 (FIG . 2A) . "cleotide 

3. The isolated nuclei acid molecule of Claim i 
wherexn the nucleic acid molecule composes the nuc l eo ide 
sequence of SEQ ID NO: 8 (FIG. 8A) . nucleotide 



15 



4. The isolated nuclei ^^aa 

lc acid molecule of Claim l 
the nucleic aci<3 - - 

sequence of SEQ ID NO: 10 (FIG. 9). nucleotide 

5- The isolated nucleic shh 
, c acid molecule of Claim i 

2 0 th e nucleic acid ^ - • 

sequence of SEQ ID NO: 12 (FIG. 10). 6 

6. The isolated nucleic acid molecule of Claim 1 
wherexn the nucleic acid molecule comprises the nucleotide 

2s sequence of SEQ ID NO: 14 < FIG . 18A) . nucleotxde 

7. The isolated nucleic acid molecule of Claim 1 
wherexn the nucleic acid molecule comprises the nuc eo ide 
sequence of SEQ id NO: 16 (FIG . l 9A ) . cieotxde 



30 



8. The isolated nucleic acid molecule of Claim 1 
wherexn the nucleic acid molecule comprises the nuc^de 
sequence of SEQ ID NO: i 8 (fig — • nucleotade 
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9 . A vector -comprising the isolated nucleic acid 
molecule of any one of Claims 1-8. 

10. An isolated host cell genetically -engineered -to 
5 express the nucleic acid of any one of Claims 1-8. 

11. An isolated nucleic acid molecul-e comprising a 
nucleotide sequence that hybridizes to the complement of SEQ 
ID NO: 1 (FIG. 2A) , SEQ ID NO: S <FIG. 8A) , SEQ ID NO: 10 
(FIG. 9), SEQ ID NO: 12 (FIG . 10), SEQ I'D NO: 14 { FIC . 18A) , 

10 

SEQ ID NO: 16 (FIG. 19A) , or SEQ ID NO: 18 {FIG. 20A) under 
stringent conditions comprising hybridization in 0.5 M 
NaHP0 4 , 7% SDS, 1 mM EDTA at 68 °C. 

12 . A vector comprising the isolated nucleic acid 
15 molecule Claim 11. 

13 . An isolated host cell genetically -engineered to 
express the nucleic acid of Claim 11. 

20 14 . A method of producing a mg gene product oompri-sing 

culturing the genetically engineered host cell of Claim 10 -so 
that the mg gene product is expressed in cell culture, and 
recovering the mg gene product from the oell culture. 

15 . A method of producing a mg gene product -comprising 
culturing the genetically engineered host <:ell of Claim 14 so 
that the mg gene product is -expressed in cell culture, and 
recovering the mg gene product from the -cell culture. 

16. An isolated gene product -encoded by the nucleic 
30 acid molecule of any one of Claims 1-8. 
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17 Th* isolated gene product of claim 16, wherein the 
gene product comprises the amino acid se.uence shown in 
Fxgure 2-B {SEQ. id N0 . 2), Pigure 8B (SEQ . ID NO 9) , Figure 
9 (SEQ 1DNG . 11)f Figure 10fi (SEQ> Pi^e 18B 

5 (SEQ. ID NO. 15), Figure 19B fSEQ t D ^ 9 " B 

(SEQ . ID NO. 19). ?; ' ° r Fl9Ure 20B 

18 . An antibody that immunospecif ically binds the 'gene 
product of Claim 16. . 9 

" mammal"' * nKth0d dia9n ° Si ^ * -ISht disorder in a 

mammal, comprising: measuring the level of mg gene 

expression in a patient sample and comparing the level to 

levels il" ^"rV 3 ^ 16 ' SP that " * oetween the 

levels is detected, a body weight disorder is diagnosed. 

mamma! 20 ' & ""^ dia9n ° Si ^ * ***Y -ight disorder in a 

mammal, comprising detecting a mg g ene mutation contained in 
the genome of the mammal that correlates win „ 
disorder. prelates with presence of the 



15 



20 



mammal diagnosing a body weight disorder in a 

mammal, comprising: measuring the level of activity in a 
patient sample and comparing the level to that of a control 
sample, so that if a difference between the levels is 
25 detected, a body weight disorder is diagnosed. 

22 a method for identifying a compound that modulates 
mg activity, comprising: 

contacting a compound to a cell t-v, al - ~ 

r a ceil that expresses a ma 

gene; ^ 

measuring the level of mg gene expression in the 

<:ell; and 
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gene 
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c . comparing the level obtained in <b) to mg gene 
expression level obtained in the absence of the 
compound; 

such that if the level obtained in (b) differs from that 
5 obtained in the absence of the -compound, a compound that 
modulates a mg activity is identified. 

23. A method for identifying a compound that modulates 
a mg activity, comprising: 

contacting a compound to a <:ell that -contains a mg 
polypeptide ; 

measuring the level of mg polypeptide or activity 
in the cell; and 

comparing the level obtained in (b) to the Level of 
mg polypeptide or activity obtained in the absence 
of the compound; 
such that if the level obtained in (b) differs from that 
obtained in the absence of the compound, a compound that 
modulates a mg activity is identified. 

20 24 • The method of Claim 22 or 23 wherein the compound 

identified is capable of treating a body weight disorder. 

25. A pharmaceutical composition conpri^ing the 
compound identified by the method of claim 24. 

25 

26. The use of the pharmaceutical composition of Claim 
25 for treating a body weight disorder in a mammal. 

27. The use of the antibody of claim 18 for treating a 
body weight disorder in a mammal . 

30 

28. The use of a mg ant i sense, ribozyme or triple helix 
molecule for treating a body weight disorder in a mammal . 

- 97 - 
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GAATI*CCGGGCGAAGGGGAGCCGGCGTGGGGGGTTGTCTATGTCjI*TCGCTGGGOGGCGG 
GOlOGCAGCACCGACGCTGCA<XXXX^CAGGGGOCr^^ 



GCCGCTGCGGTOGCGGCGGCXXrTGT^ 

<K=CGCTGCAACCCTGGCACCGGCX^ 

CTIX^SGACATCTGTCTCACGC^ 

<CTACAGAATAAGTTCAAGAGTAACXn^ 

AGAGTCTirTTGGGAAAATTTTAGCTXS^ 

CTGOGAATTATAAATATAAGACGAACrrGCACATGGCTC^ 

OCATTTTGCT ACAGAATGT AGCTGGG ACCATTTATATOWTATGATCX5C3GACTCA 

TTTAGTGGCXrTCATTOrTCCTGA^ 

TGC^TirrTirrCAGTGATGCTGCTTAT^ 

CTCAGCXXXSAGGAOACT^ 

TCGTGTGACATTCCTCACTGTAO^ 

GCTOCPCXrrTTCCTCACTGG^ 

ATATTCTGATTTAAAGCTTCCCAXaGCXr^^ 

AOXWTOIACCATTCAGATTACAGCAT^^ 

TC»ACACnX5TOGTTOTAAGA^ 

ttc^c^usggaacgtgaccaatga^ 
aaggatcagtatgcactcx^tcgacactcagcac^^ 

TCQCrrCATrCXXX^ACTCTATGGATATATAJ^ 
TACTCJKXXSWXrrCTTCrrwy^ 

GGACCATTCTrAAGGACAGCCX^TT^^ 
AOOGAACAGACACAATGAC^ 

ACACXaCCATCTATgPGTix^GCG^ 

TOCACAOCGCAGTt ^ 
AOCTCCTOCXSAGraXX^AG^ 

CCCTCTIXiAACX^CAGCaKX^ 

TACTXSCAATAAGAAAACCAGCTCCAGGAGCT<yrGCCCTAGACCAGAACTGOCAG 
TaXXCTGCaXSAAAATATCTGTGGCA^ 

CAATTATGACAATQCTAAATTGTCCTGTAGGAACCACAATQC CT 'TTTT G 

TTTCTCCTITAAGCAGCTTCGATTA^ 

GaAAGATCAATGTCmrrTACTCGT G CTX ^ 
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riXXXXATTTATGCTGCTC^^ 

CTCTGGCATTACXXrTATTGGGATGGACCGCT^ 

TAAAACTGCAGCATTTTACCTTTAAAT^^ 

ATGAAGAATTGCCTGGAAAGATGTTTTCAAGGAATTTGAACCATAAA 

GAGACTX^CCTCTCATAAAG C TT ^ 

AGCTCCCCCATCXXTTGCTACAGAACACAG^ 

CA^CAAGGCCAAAGGGAGGCCCCTGGGTAl^^ 

CCTAGGCCAAAGAAGTCTCTTCCCXM^ 

ATGTCTTATTrrin w riCCTAAAAGATAATCri^ 

AAAGGGAATGGCAGGGAGTAAGAGGCGCTGGGCT^ 
GCGCGTC^CAAGAGAGCCTCT^^ 
AGTCrrCCAGAATGTTCAGACATTCGGAGT^ 
TCTGACTTTCACCTAAGAT^^ 

ACCCAAGAAACGCATCCCCATTGTGTGATGTGTTCAGATGCA 

TCTGTCAGAGTGCACATGAAAAATCAGGCAG^ 

CTGCCGCTGCCCTTGGCCCT^^ 

CAGTCTTCrrCCTGAGAAATGrTTCT^ 

CTlXrTCCATTTTTTGTAGTr^^ 

TC«^CTAGGGAGCTAGTT^^ 

CCTCCTC^CATCCCTGCAC^C^^ 

TCXX^TTCAAGACTTCAGACAO^^^ 
GC^AATCCTGCaX^Vin^GATCT 

CATGAACATCTTTGATGCTTCCATTTCATTTATTCATCCATOCATTCAA 
ATGCTAGGCTAGTGACTGAGATGTATLAAATAGATTrra 

CTOiGCXX^TGCCCTCCCTAC^^ 

Q^ASCCAQCCCCAGCAAG^^ 

GAGGAAAATTXXX^CCTC^TCTTOTAAAGG 

TTGTTTGGATTCAAGCTTAG^ 
TAAACTATTTCATTGCGGGGATTGTGQCTIXrra 
AATAACAGCTAATTTAAGCAita^CAAGltf 
ATAAACAAAAgTAAATACTATAATACAAACT lXZXJTT^^^ 
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HHl^ATECSWDHI,^ 
HCP^CSCKCHCXSSNSSSAVB^ 

^TffiLVOFFVTFFSCFI^IJ^VAAVVWKlJ^OSCWASRIUlEOL^ 
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A^ACTGCCATTACTXTTTGOTGACCCCAGAGGTCAAT^^^ 

AAAGCCCTATGACTACTTGGTGTCTCTGGTOCTAACCCTAGTCGTTGGGG 

CATCTTACTGTATCCTX^CTrAAGGAAAGACATCCAGGCTCCCCRCr^ 

"«««^^ 

^^CAACArT^CTCK^aA 

^ACAT^ACATATAArrAOAOCXX^c,^^^ 

^^^^WWACCTATCAGTATTACTAAGTTGCAAATGTGCTCAGCftGTAAAT 
^T/ACATACATAGGCAAAAAGAAAAGTCTCAGGACAGCCTGCCTCAGACT 

A^^n^ATATTTTTCTCCATATAATTTAAAAGATTOCTCTTTCAACATl^ 
^^^roCAAAGCAGTAlX^TTCTGTTTCTATATC?rGTGTCCTTCCACATT 

^^^^^ CC,TGCT GCTACCATOCTT^TrGOCATCAT(riTCATAGACT 
TACTCTC^AAACTCn.AAAT^ 

*^^^AOCAAACTOOCTAATAGTCCTACTAATTTTATC^TOAGTCCTAOT 
OC^WATAATCACTAGAAGAAAAAATTTCCAGGOCATAAAATTAACATOS 

CTAATATGATAATGTATATTCTAOCTTCAAAAAAGCACAAATAAGACTTC 
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AAACCCTGGGAATTGTTAGACAAAGGCC/vTTTAATACTAATAAGCTATAA 

ACTGAAACCATCTGATATATGAAAACTATTAATAAAATCAAGATAAAATA 

AC<X?CTATTTATATAACTTACTATAT7vCCTAAAGCAAAATATCAAAGAAA 

G T ACCTTAAAAAG AT AAATT ATTCTT ATTTTG ACAATGAATTCTTTGGGG 

CGTTAAATTGTAGAATATCAACACATATC7VAGAAAGTTTAGAAGAAAACT 

ACCAAAGTTTAAACAGACTTTCCTCGGTAATTACTGG^ 

TTTTTTTTTTACACTGCAGTTTTTC^ 

GAAGCACTTACCACCACTCTCAGAGCTGGAAATGGCTCAAAGGGC^^ 

ATTACAAGCCTGGCAACXnKiAACCAAATAC<X!AAAACACTTGCAAAGGTG 

AAAGGAGAAAACTAACTCCAGGAAGTTCTOTTT^^ 

CCACTGTATACACCCCCTTATATACACTCAGTTACCATAAATAAAATGTT 

TCATTATAAAGACACTTACGCTAAAACCATGCTGTAATCTGAATGGTTGA 

ACATATATCCGCCAACAACCCACATTATATTTCCATT^ 

TGAGAGGCTCTGGGAAGCTTTAAATCAGAATATTCTTCTCGAGTCCAAAA 

AGACTGGTTAGCTGGCACAGGAATTGAGCATGCAGGACCTAAT 

AAAAAAACAACAACAACAACAAATAGCrTTCACAA^ 

TT AT AGT ATTCCAAGTTCCAATCT TiAGTGCAAAGAAT ATTT AAAGACTTG 
TGGGGCTAGAGAGATGGCTCAGTGGTTAAGAAAACTGACTGCTCTTCTTG 
GAGGTCCTGAGTTCAAATCCCAGCAACTACATGGTGGCTCAGAAOCATAT 
GTAATGGGGATCTGATGCCCTCTTCTGGTGTGTC^ 

GTACTCACATGAAATAAATAAATTAATTTTTTAAAAAACAGACCAGAAAA 
AAAAAAAAAAAAAAGACTTGTGTTTCCTTTAGCACTTAAGCGCAAACATC 
TTTAACTTGTGGGGTTTTAAAGGTTTTTA<1^ 

CATGTATGCCTATATACCACTT<XrTIK5CTTGGTAOCCAATGATGTCAGG^ 
AAAGGCATTGAATCCCCTGGAACT AGAG TT ACAGATCTTA^GAGCT ACTT 
TGTGGATGCTAGGATCAAACCTCAGTCCT^ 

TTAACCAAGAAGCCATCTGCTTAGCAGCTAACATGJU5TTTTTAACTTACT 

CAAGATACAGACCAAAAOGAATCACTOCCTTATAAAATTTAATACTACAC 

ACTTTCTGATAATTTC3GCAATTTCTGATAATCAGGTTAAACTTTTTTAGA 

GGTAAAAATCTTGCTGAAGCAACATTTAGTAfiAAAOGGTAGACCAAGOC^ 

TTATTATATTAACTCATGTGGAAAAGGCATTAGGGTTCAAATATAATGAC 

AGATCAAAATGGATCTTCTQGCAAGTNDCAGGOGCTGAATAGA 

ACAAAGGGAGAATTGGACAAACTAAAAACATTTACATGAACACTO 

CTGAGGACCTAAGCATAGAAQGAAAATCACTAAACCAACX^ 

CCTCAATAOOCCAGGGAATTGCCTACAGTAGCTTAGTAOOOOGTTCTCTT 

GGGTAATGGCACTAGATGAGAGCACTGJ^^CTCTAAGGAACGCTTOTCCT 

CCTCTCAGCTTGAGTCTCTX3CTTCl s CTATCACCAGAGCATGT?COCTAAT 

TCCCACGAATGACTTGCAAAGGATTTgK^^ 

CACATAGATAACAACCACATATATGTAAATTCAAAGAATCTGAATAAATG 
GAGATGAATGCTTAAATGCCACCTGATACATGATTAACATAAGGeGTATG 
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GAACTTTCTGTGAG 



GCTCCTAAAATAAACTCCCTACAGTTCACTAACTCA< 

GCACCCTCTGGTATCGCTTGCATTACAGATGCCT^^ 

AGTTCM'CTGTACAGTGAGGAATGTCACAOGACTCCCCTTTCCAGTTTTCA 
GAACATTCACACTCAACAGCGCTOCTOCTGTTACTGCTCTTACACTCTCC 
'^•GCCTGAGCAATTATTCGGACACATXSTCAAAACTACAAAGACAGGAGA 
AAACGAAGTCAACAATTTCAACTAAGCAACATTGCAACTAATGCAGACCT 
'"^CTCCTTCAGTXTAAGTTCAGTTCATTl^GCAAGTGTGACTGCAGGACTT 

GCTCAAGTAATGAAATCAAATCAACCTTGCTGCATTCACATATGAAGAAG 

GAAGAATAAATAACTCACAAAGTTAGAGAAATTACAAAACAATA<»ACATT 
TGTGCAAAATCACTTAC^CTTAGCl^GACTX^^ 

^TCTGGTAGCTCATTAGTAAAGAGTTCTACAAAAGCAGCAAGGltAl^ 

CTAGGAAGTG<^GGAAGGAGAGGAAGCCAATGAGCTGCCAACATTCACGG 

TATACATTTCTCTGTAAAGATTCTGAGAATTAACAGAATrTAAGATTATT 

TTCCAGTGATGTAGTTAAAGGTCTTTAGTAACTTTTATCAGCTTAGAAGG 

AGAAGAGCAGTTAACTTCRT^^ATGAGTTTAAGTGTCTCATGACTTAAGA 

TAACAGTTTTGCTACAATTTGAAATGCCATACITK^GACTTTTTAAAGGG 

GTGCATTAGTGGACTATTACAATAGCTTAAAAATATAGATTTCTCCTACT 

*^^^^^*^^*^ ,5 ^*^^*^''^^'»TCTTTATTAAATTCACTTAGCAAA 
ACTCCTGACATTTTCTTCC*G<^GC^^ 

GATCCTCAGTGACAAGATCTA<lAAAGACCAAGAACrrGTG<3TCCCAACCAG 

''•GGGCTGATATTTGTTTAACCTrrTAGCTCCTGTTTCrrCAATrA'TXSAA 
AAAA ^ AAA *AAGAAGW 

^^*^^^*'"^^^"'^'"^^''CCCTCCAATATATTAGTGTTATTAAGTCATT 
TGAGTAGTATCACAAATATTAAATCTAAATATC'rTACTTGTAAGTGATAT 
TAAATCCAGTCAGATTATAAGCAG<»TCAC«SAAAAAAT^ 

''CAACAGAAACTGTCTATATGTTACTATTW^TOCTGCTXSCTCCTOTTCC 
AACACACACrOTAAATGTGACTCTAGCTGGOCTCAAATrCACAGACOCAC 
CT *'CTTOCACCTCCTGGGTrATAOGCATGCGCTACTATGCCCAACA'TCTA 
.AAAGGATTTGAAATCTATGACTTTBATTGAAT 1 1 1' l> X' XT i'lSj I'lTTT 

AACAAACrTTATGAATTGACW«»GT^^ 

TACTTTACATATITTTGTAATATGTATCATATAATCTTTTrAATGTATTT 
TATAGATGTCTTATATAAGTAAAAATAGAAAAGTTTACTXIATTTATAATC 
CTTATACTATTAGCTTTCAGAOCTATTrXTGTTOTTAAACTG<^AACACA 
W^TATGTTTATAATTCACAATAACCACTGCCACTGAAGGTGOCAAAGGC 



^CCCTAGAATCrCAGTAACAACCTAGTOOOTAAT- 



'ATTTGAAGTPTTGGAT 
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GCCAGTAAATTCATGTGTAAAGATTTATTGAGTAAGTGACTACCAGCGGG 

ACAGTGGTGGTGCACGCCTTTAGTCGCAGCACTTGGGAGGCAGAGGCAGG 

CGAATTTCTGAGTTOGAGGCCAGOCTCXSTCTACAfiAGTGAfiTTCOCAGGA 

TAACCAGGGCTACACAGAGAAACCCTGTCA<X;CTGTCTCAAAAAAAAAAA 

AAAAAAAAAAAAGAATATAGCATTTTTAAGGCATTI^ATGCACAAAATCA 

TACCACCTTGTTTTACAAAAGATATATATTAACTTGAAGGCTGGAAATGG 

TGGCACATGTCTTTAGTCCCAGTATTGGGAAGACACACOCAGA^GGATCT 

CTGAGTTCAAGACCAGCATGGTCTACATAGTGAATTGCATGTAAGTTTGT 

CCGTGTGTGT AACTTT<IAAAGCTCATT AT AG AATGGAAG TGTCT AGCCCAC 

CCCACTTACCAACAGTAAGGAATATTATGTTGGTGCXX3CTCATTTAATAC 

ATGGTGTACTCCCAAGGTAAATCATTTT<^TGTTTAGTOGCTCCTATTAT 

TTTTTCCATTA1X1AATTCAOTACAACTACTACCACCAATCACATTTAGCC 

ACTAGAAAAGCCATGTGATTTGCTCCA<^CATACAAC^ 

TAAACATCTTATCAGTACTACTCTCTCTTTCACTCACTCAATGCCTAGTC 

CCCTAA <jI ' l w lMU <^ACGATTACACCftGGTAAATTCCTACTTCAGGGT^AT 

GACCATCTTAAAAACTA<X^CCTAGCAATTCTCTTTGTATAAGAAATACT 

TCCCCGTATATACACAGAAAAACAAAGAACACTACTACAGCACTATTCAG 

ATGACAACTGACTAAAAGTCACCTAATlXXrTTATTTATGGGAGTTGATTA 

AATTAGTCATTACAAATCTGTAGGTCTGCAAGACTAACCAAGAGCTTCGT 

GAGGACAATAGGTAGGGCTACCCAGAGAAACCCTGTCACCCTGTCTCGAA 

AAAAAAAAAAAAAAAGGGAGGCACAGAGAAAAAACAACAGGCCOGGGGTA 

CCTGTACATCTATGTAAGCGTAGGTACATGCACATAAAAGTGACTACAAG 

AGAACATAAACAGAGAGCGCOGATGAGAAGAGGATGGGATTTTTCATTTA 

ATTTGCGTGTATGAGAGCAOCTATATGTK3CATGTTATCCGCAOCAAAGTG 

TGTAGGGTACATTATCTCAGTGTG^ 

TCAATCACTCCGCTC Xr r rr in^^ 

TACTGGCT^SGACTAGAACTCACTATGCAAAOCAGGCTQGGCTTGAATTCT 

C^GAGAGGCTCTTG AGTGCTGGAATT ATATGCATGTGCXXX^ACACjMSCC 

CACCTCATTTTGGGGGGTAGGATCTTTCACTGAAeCTCi^^ 

GGTTAGACOGGACTGGCCAGTAACSTTOCAGGACCTCTCTroT 

CTTCAGCACTGTGATCACAGGCTCACAAGCACACCTGGACTTTTACTTGA 

GTCCTGGAGATCTAAACTCAGCTCTGCATGGCTGTGCAGA^XSGAATTAAA 

CTGAGCCAGCTKrTCTCAGTATCAAGAGAGAACATAGGAACTGTAAGATTC 

TGACAGTACTCTAGGGCTTACAGAACA<XX*ACAC^ 

TCAGTTAATAAAAGAATAAATACAAACAAAAAAACATGAGAAACATATAG 
AGGCAGAGACAGACAGACACACAeACACACACACACAC^ 
AGACACACA<y*GACAOGCACTTAGA<XX»^ 
GCX^CCTAGAAA<^<»rrACXnTXX^TGCA^ 

GGGATTTTTAACCACTTGTGGGAAATGC5SGTAC1CTCCTATTCTAGCACA 
GATTTGAGGAAAAAGTAGAOCAGA^SAGTCTGTOCTTOCACATATCCTXWA 
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AAGTCACTGACATGTCCAAGTTTTGATTTCTTCATAGGGACAATGAGAGA 
^CCC^GACTATCraCAG<^GCA^^ 

AGAAGTGCTTACAGCAGTGTGCTGCTAGAAGGTGCAACAGTCT^rcTTACA 

<^GGGCATTTAAATATGCAGGATGGATAAGTTTGCCAACTACAACTACAG 

AGGCTGGACAAGGTAGGACAGCTTCTTCACTGTCAAAGACGTTTGGGCAG 

T^roCTTCTATTTACCITAAAATCAAACTGTGACAGCTGTGGCATATATAG 

ATTTCTCCCAGAATGAAAACACATTAACTCACTTATGTCAATAATATGGA 

GTAAACACAAACATAGTCTATCTAGCTCAGCATGCAAGACATGTGAGGAA 

^* t ^*^^^*^^*' , '*^*'^^^^*^ , CCCTGTCCCTAAGGAAACCAATATATG 

TAAATGTAGTCTAAGCTGCAGGCAGTTCTTCAACTGCCTACCCCAGGCTG 

^'^^^^^^'''^^^^^'^^^^'^ACAGACTAGAAAGTATGATCAACCTCTXS 

AACACTGTGCTATAATGTTACCATCAATCTCACACACAAATTTCATAACA 

TTTTAAGTAAGTCTATGATGATTCTATGTTGTGTCCCAGTTATATAAGAT 

CCATAGGTCACAGGGTAGACATTCAAGGACACCAACATTTGGAATrTTGG 

GT*T^U"X"lU , GGTGTACTXWATATACTTGCTAGTX»CAGGTACCCATGOrcAT 

GTGTGTAGAAGT1XJGGCGTCTTTCTTCTATCACTGTCTACTTTATATTTT 

CTTTATTGTTTCATTTGATATGTATAGGTGTTTTGCCTGCATATATGTCT 

^'"''' > '^ , ^'^^^^^''^^*^^^*' > ^AATTCCCTGGGACTAGAGTTACAG 

GTGGTTGTGAGGCACCATTATGGGTACTGGGACTCAATCCTGGGTTC7CT 

GGAAGGGCAGCCAGTACTTTTAATCACTGAGCCATCTCTT'TAGCTTCCTT 

^^"'"^^'''^'''^^'^'^^''^^CTTCATTCCTTCATTCCTTCATTCAGAGG 

ATTGAGATACCTTCCTCAGTTAGGCTGGCTAGCCAATGGACTCTCSOGAAT 

GCAGGT^CTACTGGGTTCATTTTGAAAAATTACAGAACTATGTATTTTCT 
TCATAAATCTGAAACTCAGCATAACTGTCTCAGGCTAACATGGAATCCCT 
AAATATATATGAGGCACAACCTGACTTrACCAACTGTACTAl'GTAAATTT 
GCrTAGTATATTAGTCAACACTTAATGGAAAAACATCTGATAAAAACAACT 
^*^*^^^^^^*®®^**^^*^ACACTTGGGGA6GTGGATTCAAGGCAOTC 
ACTGGATTCTTGAATTTAAGTOCAGCCTAGGCTACATGAGATTCTOCTTC 
AAAAAATAAACAATTAAATTTATGGGGGAAAGAATGATGTATTTTGGTTT 
CAGAAATTCCATCCTATCATCCAAGGGAGATATTCTATAACAGOGAAGTT 
CCTCAGCTCACAGCAGTCAGTAGCATATAGACAATCCTGGCTCCAAGCCT 
ATGAAAACACAGCCTGTACTAAAGGT G 1 1» 1 It-Ll^'lUa W mAGTCAGAT 



ATTIX3GGAGGAATTAGGAGGTGTGGOCTTGGTCSGAAAAGGAGCATCACTA 



**^^'^'^*"®*'^' , ^***'^' , ^^'CCTGCCATCATCCCCAATATGTOCTCT , CT 
*"^^^^^' , ^' , ' , ®^'^"''^^'^^^TAlX^GCTCTTAGTTACTACTTOCACCA 
CCTACCOCTGCTATCTCTGCTCCAl'CATCATXSGACTCCTATTCTGGTOGA 
ACTGTTAGTCCAAAAAAG VI\JTTCTACAACTTGATTTGA'rocCAGA 

'^^***^^^^^^*®^^^***^^***^^TATACCAAG<^ATAOCATCTTGAAC 
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TCTAGGTGTCTCTCAATCCAATCA^ 
TAGTCATCCXXIAAATCAGTGTATCTCTCTCCT 

TCAAGGGTCAAAATATGTAGAAAGGAA<1AAAGATTCTCAAAGGTCAAGGA 

TCAGACCTTGGTGAGGATTGAGCACTGTCT 

GGGTCCACAATGTAAAAGAGAACTGACCTGAACAG^ 

TAACJUUVTGTCTCATACGTATTGAGTTTCTTATAAATAAATAAATAAATA 

AATAAATAAATAAGCAAGCAAGCAAGCAAGCACTTAAGAGCACTAGCTGC 

TTTCTTCCTGAAGACCTGGTTTCAATTACCCAGCACTTAT 

ATACCAATTGTAACTCCAGTTTGATGATATCCAACATCTTCT 

TCAGACACCAAGCACCAAGCATGTAATGGTAT 

ACCCATACAAACX^AATTTTTAAAAAAATATTCGAGCGGGG^ 

ACGCCTTTAATCCOVGCACTCG<X^GACAGAG 

TCGAGGCCAGCGTGGTCTACAGAGTGAGTTCCAGC^ 

CAGAGAAACCCTGTCTCGAAAAACCAAAAAAAAAAA^ 

AGTCATTTT AGGGCTGG AG AGATGGCTCAGGGGTT AAG AGCACTG ACTGT 

TCTTCCAGAGGTCCTTAGTTCAATATCCAGCAAGCACATGGTGGCl^ 

GCCATTTGTAATGGGGATCCAATATCCCATT^ 

CTATAGTGTAAATAAAATAAAGAAATCATATAAATAAAATAAATAAATCT 

TTTTAAAAATATTAATTAACCC^GGCrrcAACCT 

ACATTAGGCTCTTTAATGCGGGTGTTATAGGTCTGAA 

AT AAT ATTCTTCTG AAG AATGTGCCCTGGTCAATCACCATX1ACCACACCT 
GCCAACAGGTCCTTCATAAAATACTTGGTATATGTIXjAATGTTCCATAAA 
ATTATGGAGCTAGAAAAGGTAGTGAGCTAGAAGGATATTAAAGATATAAA 
CX^TTGCCCX^GTGGTCCTCACATTTGTCT 

CTGTTTTTATTTAGAATTTCAATATATAAAAGACAAATATGAAATAGTCC 
GGAAGCAAATTAAGCTACAGCTTGCAGCAAAGCCAGATAGAATGCA<^TT 
AAACTAACACAGTACCTTTCTCTTATGTTTTAGATGCT 
ACAACCCCAGCTCCCCTTGAACTCT^ 

CATGCTGCTAGGGTTAAAAGTATGTGOGACCACACACWSTrTTGAAGTTT 

AGAGCACTTAAATGATCTATTCAGCAACTCAGGCAGGATTT 

GTAAATTATCTTATGAATCCTTTTTGGTTTTOCTTTTATTCATTTCATTC 

ATCSCACCTTACATGAACTATCTATTGCTAGGCTGTCTCTATACTQGATGC 

TCAGCAOVTCACX^lACATGOeGATTCTTCTAC^ 

GAGAAAACCACACAACCTAAGACAGTAGGGAGCTGGTGC^^ 

ACXXXrPGGCrgrCCTAGAACTCJ^ 

CAGAAATCCGOCTGGCTCTCCCTGGCAAGT G CTg G GATrAAAGGGGTGTG 
CCACCACX^CGGG CT Xn \Xt w l X^ X Xr i X S ATTTTTAAATACAJ^^ 
CTAGCAATGTAACTCAGTAGTAAAATGCCTGGCCA<3CAT^ 
CAGACT<XUlCOCTGA<SCACCACAACA OT 
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Ai^i-i ATGTGCATGAGTGTTin\^^ 

CCTGGTGCCTGTGAAGGTTAGAAGGCAATGGAGCT^^ 

CTAC<IATGTGGAAATGGAGCTATGGAGAGTTGTAAACTACC^ 

CTAGGAATTGAATCAGGGCACTCCTCTGCAAGAACAAGAAAGGC^ 

CAGCTAAAATATTACTACAAACCCACACCACAAAATTTTAAATTCATA 

CATTATCACCTTAGTTCTAGATAGAGAATGTGCTTGGCATTX^AAGTACT 

AAAAAGGTTTTGGGGTGGATCTTTTATATTATCr^ 

AATTAATACTCAAATATGTTATAAGTTAAGGTTTTT A T w l ' l * l M X\jl UU , 'l* i < CA 
TTTCTCTATTTTGTCTATGTAGCTCTGC^^ 

TT^^CTGGCCTCAAACTCAGAGAGACCTGAACGGCCCT<3CCTCOUUlGAG 

CTGGGACTAACCATGCCCAACAGTAGGTAGCTTTAATACCTAACCAGTGT 

ATTAGTTCATGCTCTCAATTAACCAACATTCTCTACAT 

ATGCCTATTTAATCAAATACACAGTCTAAGTAAACTCTAAGTACAACTCC 

TTGGCTCATATTCTTACAATGGCTATGGCTAGCTAATTCAAAGGCCAGTC 

ACATAAAAGGGTCTCTATGAATTCTGATTAACAAATGCAGTTAAATAGAT 

GAATTCCTAAAAAGTAGTATCATAATAATATCATATTTAGTTTTTGT^CT 

TCCATTATAGTTTGAGGTGCCTCCTCCCATAATGCAAGGTATATTTCAAA 

TAATAGATATATACATGGTTAACACATGGCAAATGCCATTTTAAATGCTT 

AGCACAGCCTGCTCTTTGGCTCCATTAAGTGAAACTCTTAAGTTCTCAGT 

TAAAATAATTGTTGGAGAGCTATAGGAGCAATGGGTGGAGAACTAGTCTT 

^^^^TTTGTCCTTTGCCTCCTTGCGTACTAAGTAGTCCCTCCCTCACTAT 

GTGGCATTCCAGCAGACTACCACCAAGAGAAGAACAGAAAAGTGTTCATT 

TCTTTCTAAAGTAAAGAAATAAGGGGCCAGTGAGATACCTCAGCAGGTCA 

AAGCCATTTGCCTAGAAACCAAAGTTCAATCCTTGGAAGCCCTGTAAAGG 

TGGAATTAGAAAACAGACTCCACAAAACTGTCCTCTAACCTCCACTCGGG 

CACACATGTGCCAACCCCTCCATTCTCCCTCCCCCACATACAAAGTAACA 

ATAAACTTTCAGAAAATraAAGTTGCTACGCATGGTGATTGATGAATGTC 

TTTAATTCTAGCTCT'lXaGGAAGCAGAAGTGGGTGGAT CT 

AGACCAACCTX3GTCTATATAGTGTGTTCCAGGCATCCAGGACTACACACA 

CACACACAAAATTACGTGAAGGAAGTAGAATGTTTGAAGGAAAGLAA 

GGAAATGGGC^TGGAGAGAGACCTCAGCAATTAAGAAAAGGTCTTGCACC 

G*^CGTGGTGGTGCATGCCTTTAATCCCAGCACTCGGGAGGCAGAGGCAG 

GCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTOCAGG^ 

CAGCCAGGGCTACACAGAGAAACCCAGTCTCGAAAAAACCAAAACCAAAA 

ACAGAAAACCAGTAIX^TAGGTCAGGCAATTGGATCGAGACAGGACACTC 

AAGATAGCTAGCCTGTGCAATATAGAAAGAAGTCTCATGGAAGAGAGAGG 

GAAGGGAAGGAGGGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 

GAGAGAGAGAGAGAATGAGAGCGAGAGAGCGAGCGCACCTCAGTTGATAC 

AAGATTGGGGCCCTGAGTTCCATCGCCAGCATCCCATAAATTGGGTGTAG 

CAGCACACACCTX7TATOCCAGCAGAGAGGCAAAAGACAAGTTCAAAGTCC 
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Yj^j^XGGAAAAAGTCSTGAGATCAGCCTGGAGACCTGGTGTGTGGCAGTGG 
GGTG AGGGGTGTCATCAAGG AG AAGGCTT AGTAAGT AAAGG ACCTGCGTT 

CAAACTTGTAAGCCAAGTGTTGGACTGGACAGAGACAGGCAAATGTTTGA 
GCTIxrCAGGTTCAGTAAGAGACCCTATCTCAAAAAATCTGATGGAGAGTAA 
CACTGGAAGAACTCAGAGTGAGTCACACATGCACACACAGGTGAATGTGT 

atacaaagggggcagggagggagaatgagaggagactgggagatatctgt 

AGTTCATGTCT<?rAATTCTAGCACTTCAGAGGCAGCTGGAGCTACACAGC 
^^qj^CCGCGTCTCAAAAACAAACCCAAGCCTGACAGTGGTGAGGTACACC 
TTTAAGCCCAGAGGCAGGAGAATCTCTGAGTTCAAGGGCAGOCTGAGTGA 
GTTCCAGGACAAOCAGGGCTGCACAAAGAAACACTGTCTTGAAAAAAACC 

aaaaccaaccaaac^aaaaagaatcaaaaac^^ 

AAGCAAACAAGGGAGAAGGTATAAAATOCTTAGGAGAGTCTTCCTTTAGT 

CTCCA|rcCrTTGGGTACTCCTTCG-CCACACAAAGCCACTACTAGCAATTT 

CTTACATAAGCTGCTGTTTTAGACACAGGTTTT'TTTTTTT'TTTAAATATA 

GTAACATATTCATGTGTAGCTCATTTTTCTAGTGAGTGGTTGGTGCTTCT 

TTTAACAGTTTAAAGGACCTCTATGTTTAAAGGCGATTGG<H3CTTG'I ! CTG 

GAGTATGGGTTGTATTTTCCCAATTTGTGAGTTTTACCCAACCTATTGCC 

TATTACCTATGGCCATTTATTCTTGTCGATAAGTAGTTTCCAATTGTATG 

ACTATGGTCACAGTKSTTCCATGGACTCTTCTGCCGCTAGACAtSCGGCTte 

GrcTCAATTTGAGATGGTTACAAGGGTGATTGGCTCTGCTGCCTGGGTGC 

TCXXaTTAAAGGCCri^CCWX*^^ 

j^<T<^GGTTTTATTGTGTTGCTCAGGCTGATCTCACTCTGCTGGCCTCAA 
«?rATCCTCCCATOKX»TAC^^ 

TTT1TTTAAATTAAGTTTTCTTTCCAAAATATAGATTCAGAAATGTGAGA 
TTTTCACAAAGTGAACCTGCTCACTTGCCTGGCTCTT1GAATCTGCATTGT 

(XK^PCXXGCCCATCCCTTTW^^ 

TfCOCATTCCTTAACTATJWCiCTGTCUU*l W »l^TT G GGTSTOAJ>CTTCCTTC 
QQCTGAGAATCACCTWnTCCGGGCACATCAGGTCAGTGjfiffiGG"ISGTTTfGC 

AG AGAGTITTAAa^AGA«3^<»C^ 

^C£TOATGTTCTGTCATCCTGGACTGGGTAGGAAGAGGAAA<3TAAGAAGC 
AAACGGCACCCGCACCTCTCTCSTCTGCTTOCTCGCQGACACAAAGTGACC 

AW XKXnCCCACTCCTG<XCCX^^ 

CCjjVCCACTCTGAGACrGTGCCTACTAACCGTGACCCAAAATAAATGTTT 

OUTim. X T AAGGTTGCCTTTGTTAGCTCCTTT AATAG AGGGGTACGACAT 

GTAACTGCCACAGGC^SGCATCGCTG^CAGCC^CTCCCACTGACCGTCTG 

AGAACCACACTCAGCTGTAGGCACAGCTCTCATAGCTGTGTGGGOGTAGC 

"I^TGTCTACTCGGTCATTCCOCTGCTGCCGAGCATTTATTGTtTTCAGTT 

CCTCGCTCATGGGTAGCACTGTTAIOAACAT'CCTAGTACAAA 

ij^^C^CGCCeC^TSCATTTTTGCl^GAGAAAATGGCCAAGGATAAAATGCTA 
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GGGCCAAGGGAAGAATATTTCACCATTAAGAGACACTGGTCAGGAC1XK3A 

AAGATGGCCCAGTGGTTAAGAGCACTGACTACTCTTCCAGAGGTCCTGAG 
TTX^TTCTCAGCAACCACAT^^ 

ATGAAATAAATAAATAAATATCTGAGAGAGACAGACAGACAGACACTX*GC 
TAGTCATCTCACAATGTTCTCATGTTTAAAATATGATACCATTTCTATAA 
AGCAGAAACACAGGAAAAATAAAATC-I^TX^ATTAT^ 

ATTAACTTGATTAGTGAAGTTAGCAGCTACACTGGGCAGGGGTTXK3GAG 

GGGGTACTCTGAAGTGCTGGTATTTCTGGT^^ 

TTTTTTATCTTATTTATATTAC^^ 

CCATGTGTATATATTGTGCTTGAATTACAGCTAAGTAATTATTTCTGAGG 

GGCTTTAGACTACTGAAGATTGGGCCCAATGAGCCCCACCCXAAGTAGTC 

^^^^^^^^^*CTTGGAAGTACTTGAGAGCAAAGATTCAAGTCACATt^ 
CCCCAAACCCTCAGCAGCCACCACXXTT^ 

TCATCCTGGAACATCTTGCCATC^ 

TCTGGTAGAGCTGGGTTTCTGTGCTTCTATTCAACCATGTACJ^ 

TGTGCCACCTGCCATGTGCCAAGCCTGTGCCAGTCCCTGTGAGCGAGCAG 

CCCACCCGGTGAGTTATCATGTGAGGAGCTATGAGGAGCAGGAAGGGGCC 

CGGATGACTTCAGCAGACAGTATGAAGCAAGCACTGTGOGATTTATCCTC 

CCTGGCCACATGCCCACAGATGGTGTCTGAGACACTAGCGTTTAATATTT 

GAATTCTCCACATTCTAGCCTAGACATTTTGGTTGCJAGAAGAAAATIX»A 

CTCCAGTTGTATCCTGGAATGAAATTTATT^^ 

CTCCCAGAGAAAATACGATATTCAGGCACAAAAAGAAATGGGGACTGAGG 

ATCTGAAGTTCAAGGTCATCTGTAATGAGATTGAAGTCAGTTTGGGCTAC 

ATGGGACCTOGTCTAGGGGGAATGGGGAAGAGAAGGGAAGGGATCGAGAT 
AGGGAT 
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CAATGTGCTCTGACGATTAATGG^ 

GAAAAGATGTCATGGTTCAGGAGATTGCTAGTC^^ 

CACTGAAAGGGAGGAAATAGCCTGGAAGAGATAAAGAGACA<^ 

TAGGAAGCTTAAAATTTAAATTTTGTTGGAAGTACTKnTAGGAATACTAG 

CAGAGGCC^GATGAATGTATGGTTAACTTATAGCAAAGGAAAAGATTGTT 

AATGGTGAGGTTAGGAATGCAGGGTGACACCAAGCTGTAATGTCAGCATT 

AGCGAGATAGAAGCAGGTGTTTAAGGCCATTCTCTGCTACTTAOCAAGTT 

GAGGCCAATCTGGACXIACATGAGACCTTTTTTCAAAAATAAATCTCCTTA 

AACAAAAGAGGCTGGO* TGATAGATTCTTCAAG ATGTT AATCT AAA 

TAA^TGGAAGACCAAGGATGGCATGCTAATATCCTCAGTGTCTGAAGAAG 

GACT ATGT AGTGTTGGCTGCTGACTCTGAAGT AAGTOCTCATT ACTGAGA 

GATAGTGTATCTTAGAGCCTGGCAGATGGGA1XX5AAGTGAGGAAGCAAGT 

AGCACCTTTGTATATTATGTTCTAAGTAGCCAGAGATACTTGACAOAAAA 

CAAAGTTGAGAAAATGTATCTTCTAGAAAATACAGACATGGA 

CTTTCTATAAAAGAGGTATTAAACATTAACCTGAAAAAAAAGTTAGCAAA 

TTGGGCTTTGGCAAATGAATATAGT^ 
TGTATATGACTGTTTGGCTTGTTGTACCAT^^ 

AAGTCAGCTGGAGTTACAGATGGTGTGAGTTGCCATGTGGGTGCTGAGAG 
ATK3AACCTAGGTCCTCTGGAAGAGCAGTTAGTGCTCTTAAGCACTGW3CC 
ATC*XXl"XXZ^AGTTOC'l w lV w IXSTAGAATTTTCATTAA 

TATAAATGATAAAACCATGAGAAGATAGACOGGCACTAGAATTAGTGGAG 
TCAAAATGTTAATGATATGTCAGATAOGOCTTATATGAGGAAGTTGCAAA 
ATTATGAAAATCCAGGCACTCCACTGAGTTAGAAATCTAGGCTCTGATGC 
ATACTGCTATGGTAAGCrTAGCAAGTGGCCATTGAGTG 

GGATGGGTC g l*l l C'X\>U'i>j TTGTXSGAGCACACACACTGCTGTCTTCroCATT 
GCAGTTTCACCTC^ATTTCCTTGGAACTACTTAGCTTTGCAACTA<3GOC»T 
TAAAAAAAACTTTATATTTAT Win^ 

ATTTTATGAGACATAGTCTCACTCTCTAAOCTAQGCTGGC?CTGGAACTC5C 
CTAGGTAACTTGAGCTGGTGATTCTCTTGCCATAGOCTTCTAAAATTTTA 
GArrrGCAGGCATAAGCCAGACCACTCCTX^ 
ACATCaJVGTGTAA CnnnW l V ACATAACTAA^ 

TATlXaTTAATCO em T^ 

ATATATAACCACAGACCTTTCCACAGGCAt^ 

GACTCOX^GACGTCTTATATATGAATGAATCCCT 

CTGATTTCCACGGGTTCATAGCTCAGTTATCCCATTTAAACTAGTCTAAG 
TCATGCCATGAGGCTACATAOOOCTCCTTCAGTTTCAGGOGACTGTCTTC 
TCAGTTGTSTAATCTCCTAUXAr^^ 
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tgcgtcatagtccgtctgtc^^ 

ACTCTACTCTAGAAGTCCTCTCTGTGCT^ 

CCGTCTAAGCTAATAGGCAACAGCATTGTACAGACAGGTGATG 
ACATCGCACAGGAGATTCTCCCTAC^CAGATACTO 
ATGCAACCGTCCAGGCGTGTTCTC^ 
AGTCTGATGAATTTCTTTGTCTTTACA^ 

AAATTTTG ACT AACA T I "1 "I W 1 " 1 ^1 W 1 W A'ATTTAAATT ACAGACTAACTGGCTC 
TTCTGG ATTTGT AACAG A TGG ACCTGGGAATT AT AAAT AT AAGACG AAGT 
GCACATGGCTCATTGAAGGACAGTAAGTTATAATGGCT^ 

ATTTATTATAAGAGCACAGTATAGCACAAAATACTTCCATGTGTGTTATT 
GCTATTTCITCAGACAGGACCTTTCTC 

GAATTTTGCTATGCTACCTCTXSCTTCCCAA 

GACX^CCATGCCCTGCTGCTAAAATACC^ 

GCTCAGTGGGTAAAGTGCTTGCTGAACAAGTCTGGT^ 

CCTGGCTCCtZACAGTGGAAGAGTGACTCCT^ 

CACCOTTGTGCATGCACGC^^ 

AGGAAATTTTCTTTTTTGGGTGATAGGGATTGAACCTATC 
GCAAGTGCTCTATTGTTAAATAATTCCTTTAAT^ 
TTAGGTTOCAAGTTGACTTAATCOT 
TTTGCATATTTCTAATAGTTTAAAAAAC^ 

TTTGCTTAAATCTTTATATAATAATGCTATTATATCATTTTTCTAAATAT 
TGATTTTATTATCAGCAAAACAGTAAATGAGCCATCAGAATAACCACTX^ 

TTTCCTT-rinnxnXKXTGTCATTT 

AAAACT1TCAAATTTXXAAGTAACTCTTGTTO 

TCAACCXIAAGAAATATTATrTACTAACTCATTTAAAAGCAAC^ 

CCCACTACATGTTAGCAGAAAAACCTA^ 

A<^CTACTAAGCACTACATGGCATGGC^^ 

TGCTGGGATT ATAGACATTACCAACACACCGATT1X>3GGGGTTGG 
TCKXIATCAGTCCAGAGTTGCATGGATGCT 

AGCTATATCOCTGGTCATAAATGTCATAAOGAAAAAAATTCCTTATATTT 

AAAGAAATTTTAAGAATTGCATTGTTTAAGATTTCAC^ 

ATCTCG<^TC T1V1T1X^ 

TGTAAACAAACTOAAATATGAATGGGACAGTTCCAGAT^ 
TTAAATATTTO^GAAAAATTGATAGGTTTATCT 

AGAGATTTTAGTAAAATTTGAAAATGGAGCTGGGAGGTCTGAGGTAGTC^ 
TCTAAAGCTGCCAGTTGTAGAG<XrrC?r^^ 

TACTGATACACTTK3TTGAAATTGCCCAGGCTTCA 
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GCTGTTACTGTGACTCTGGGCAGGGCTTGT^ 

CTCAGTCAGAGTTGAT ACAT AGTTTGCTGAGG ACGTGGCTTTTTGGT ACA 

GTGClK3TGAAAAGGCAGAGAAGCA<KrTAAACTTAGAAA^ 

AAAGTGATGTGTT ATGAAATCTT ACGT AAG ATG AAT AAAG AAAGAAG TGG 

GGACACTGAGGGCTCCTG-rTT^ 

TCTTTXSAAGCKXXCT^^ 

CATTTTTG AT ATTCCCTT ACACAT AGCAAAT ACT AAC^ 
AATGCAGGAAAGCTGTTTATATTTATATATATTTATATTGTATATTTTTC 
TCCTT AT AAATTCTTT AAAAGTCTGTTTT AGT AGTT AATGTT ATGATT AT 
TATAAATTACTTAATTATTTTTCTAG<X 

CTTCAACCATTTTGCTACAGAATGTAGCTGGGACCATTTATATGTTTATG 

ATGGGGACTCAATCTACGCACCTCTGATTGCTGCCTTTAGGTAAGGOCTG 

CTOCATTTCATCTCAGGAAGTAAGTKSTGT^ 

GCATTTACTTTATTCTGCAG^CAC^ 

CTGGTGJVGCTACAGTTCACTTGGTTTTTATGTACT^ 

T AT ACT AGT AT<TT AGCXI1ACGCTT AGTCTTG AACTTCTGGTTCTCCTGCCT 
OCACXrrra^GTGCTAGGACTATA^ 

TTTCACATTCTTG AACTGTGAAGTTTTG AT AACACT ATT AAATTT ACCTG 
CTATTTGTGATTTTGTTAAAGTTTG^ 

AT AAT ATTTTTGTG ACAAATTT AAATCAG AAACX1AT ACC'i"l"rCTTG TTCT 
TGT ATGT ATTTCATTCCAT AGGCC CTT AGGAAT AACTrTTTTTCAAT AGT A 
TATAGrKnOTCAGTTTGTATATATCTATTATO 
TCrTCGAAGACTATTTATAAATTGGACAATG^ 

AATTTGCTAG* l *T'l l 'l ■rGTAAATCCCTCCCCAATGCATCTGTATTAGT 

GATTTAATAAAATAATGCAATTTTGTCA<^ 

TTTGCTATTTTATTTTAAC^^ 

GTCTATGATAlxnCTCOCmne^ 

CTCJWCMOCTATOTS^^ 

ATCATTATA1TCCACCTTGTTA 

ACTGG^AAOCAAGAGCAOCAAGACCTCTTCTTCTl?CTTCTTCTTTTTTCA 
TTTTOGGTTTTTCAAGACAGGGTTTCTCIX^AT 

AACTCACTCTGTAGACCAGGCTGGOCTOGAACTCAGAAATCTGOCW 
CTGCCTCCXaUVATGCTGGGATTA^^ 

AAAAGATTTTCTTACTAAAATATATTTCTAAATTAATTitfSTTGGAATCTG 
GTT<^TA Cnnvnunn TCAAACAAAA<X^«^ 
^GAGACATTGACACTAGACACTGOTTATGJ^ 
GAAATTATTCCACCCTTGTAtt^^ 

AAGACTTTTTAAAAGCAAGAATTGTATATAACAGACAG^ 
CTATTTAGArcmA^ 
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GGGCATGTTTT^ 

TAGCWTCTGGGAATCA^CCIXOT 

GCTGAGCCATCTCTCCAGTCCTC 
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aggcaagaaagagccagggagcctccagacagaccattatsaaattccaca 

gtcagcacaatagggagaaca<5taaatcttacattaaaagaagggca<x5g 

cctggtagcaaaaggttttaatttaa<;cacttga<3a^ 

atctctctgattgggggttggggttaatggt^aatg<xiatgacacgctg^ 

tcagagttagccttctcccctaaaaaaattttaaattcatt^ 

gacacagttaatcatagacattgtatct<tagacacctcaacatactgcag 

actgcagcaccaocccactgctgaggctgtcgttca<5ttggta<3aaggca 

tgctcagcattcgcgaagcacca<3acttcatcctta<x:actacat 

tgggtgtggtcatgcacacttataac^tc 

ggatgatgagaacttgaggatcattctcagttacata*^ 

taagcaggggtaca<x3a(3gcctgtctcaaacaaacagacaaacagacaaa 

caaacaaacttcat^aaactcttgaagtactag^ 

attgtaggtatatgtcatcatgcctgttgta<3aatga« 

ccataggcttatagatttgaatcttggtgtctgtctatgtcatgtca 

ctgcaca7vaagcccacactaggcccacacattctctctc3 i 1 3 ctgctgcatg 

tgggttagatgtgagctctcagctgctgctc 

tgccaggctccagccatgacggtca<xx;actaactctctgaaactgtaac 

caagttcccaatgaaatgctttctttc^ 

ctgcttc^cagcaataacacggtgactaagatac^ 

GCACCCCACCATTATTTACCATAAAGTAAAC7VATACACA<3TTGGATAACA 
TGATACTGAA<5TTATTTTCCTGTTTCCTGATGTAACCX:^ 

<3attaagccttaaatagcaa<x:tgtc 

aggccaatgtctgctttaccaaattctgtt^ 

c^c<:tcgactcctgtcatggca.t^ 

ggtcacaaccttttagtacacagattgcaactct^ 

gcttggctaattaaagcaagcta<3a<3tttgt^ 

ggggaggtggtatttacaaaattttgtaaataaact^^ 

atgtatataaatttgatgtcgctccttttaaatc^ 

cccaca<3aatcatctgtttgattogaaagattc 

ctgctgaac<ti\3aaatgattcataatgatgtgtctc 

tcacctacggtttttgttttagttc 

atgtatgtgtgtgtgtgtgtatttatgtgaatgtatagctcatgtatgtg 

gtgctcaaggacacctgaagaa<x3gctctgga 

tgtgagtgctaggaaagaaatxttgggtao^^ 

tctaaccactgagaaatcctgcca<x:c<x:t^^ 

acaaaaccaacactagttacataagtatctctctctctttctttctctct 

TTCTTTCTCTCTCTCTTTCTCTCTCTCTCTCTCTG 

CACACACACACACACACACACACACAAA<MATCCATAATAGTTCTTCTGT 
ATCCCGGTTAAATATAA<3TTCTTAX5GGGCTAGA<3AGATGGCT^ 
AAGAGTGCTTGTTGTTCTTXX1AGA<MA<XXIAA 
ACATCAGGCAGCTC^CAGCTACXrCATATCTCC^ 

tcaatgcctatggcc<^tgcaa<x:a<^ 

TCCATATATATAGCTAAAACTAATAAAAATAAATCTTCAAAAAATT/^ATT 

ctggttgaactgaaaaagatcacctaacatttagaaaaagcagtt^ 

GTGAATAGGACATAAATCA1KK3TATCAAATATTCTGTTC 
AACTAGAAAAAGCATK3TGTTTCAAATAAC 
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GGCAACCCCAACATCTGTCAGTACCTTGCAAACCAACACAATAAATTTGA 

TTTTATTTAAATCGTAGTTATTTTTCATGCTAGTAGTTTTGAAACACAAT 

AAATTTGATTTTATTTAAATCGTAGTTATTTTTCATGCTAGTAGTTTTCA 

AACCAAGATCTAGATTTTGTATAGCCACATAAATACACATTAGAATTGCA 

AACTGATACGAGCTTCATCTTCATCAGTCTCTCTTCATGAAAAGCAGTTA 

CAG<X3ACTGAGACATGACTCAGCAGTTACGGCATGGGCTGTTCTTCCATA 

CGACATGGATTCAATTCTCAGTGCCCAAATGTTGGCTCACAACCATTTGT 

AACTCTGGTCCCAGGGGATCTGACACTCTTCTTGGCTTCTATGGCCACTG 

TATTCATACGGTACACAGACACATATGCAGGCAAAACTCAACAAAAAAAA 

TAAGGTTTAAAAAAAAGAATTAGAACTTAAAGGCACTTCATTCCGTCAGC 

ACTAAATCAGCCTCTCTGGAGTCTTCCCACTTCATGAGAAAATCGTCAGC 

TCTCCACTGCTGTCTGTGGCTGAGGAGCAGGACCTGGACAACGTTCAGAG 

ATTGTCAGTGCATCTCTTTTCTTCTTTGGTTTGCTGTCATCAGGTTCACT 

GTCACATTCCCTTTGTACCATCCTTCCTTTAACAGCCTTTTGAAAATGCA 

GAAATGTTGGATGCTGCCTTCAGTTCACACAGGCTGTCTTTTTAGCTCCT 

CATCTATCTATGCTTAATTTGTTAGTGGTGCTCACCCATGTATGTGTTTA 

TGTCATGAAGCCACAAGATGAGCCTTGATTGAGTCTTGCTGTCAGTGTGG 

ATCACAGAAATGACACCCTATCATCTTTGCTTCCTGCTTGTTAGAAGTCA 

TTGATTCTGCTTATACTCAAGGCCCACAGTATTATACiaX3GGTGTGAACC 

CCAGGAAGCAGGGAGGTGGGGGGTGTCATGGATACTACTCAGATATCTCA 

CTGTTGTGATATTTCATCAGTTCTCATTG<3TCCTATCTTTAAAATCTGCC 

CTACATCTAGAGCTGGCTGTGGTGGTGTGTGTGGTGGCATCAGTATCAGA 

ACTTG<3ATTACAGAGGCAGGAAGATTGTGATTTTTGAGGCCAGAATAGGT 

GCATACAAAGATCCTGTCTGCAAAAGAAACAAATGTGCAAATAATTATAA 

CTACTTTACTAATAGCCTAACTAATAACCACTGCTAGTGCTGTGTCCACG 

AAAAGGTGAAGTAAACTGTGAAAATGACTTCCCCTTCTGTGTGACAGACG 

CCGTCATGTGATTTTACTTGTGTCTCATCATTGTTTTTCCTTCTGTTTCC 

ATGTGTGAATGTTCACATGTGGAAGCCAGAAGTCAGTGTTGAGTGTCTTC 

ATAATTGATCTCTATTCTCTTTGTTTTGAGACAGGGTTTTGAGACTAAGC 

CCAGTGCTCAGTGATTCATCCAGTAAACTGTAGGGAGCTTCCTGTCTCTG 

CCTCCACAGTGTTGGGATTACAAGCATGATCCAAATTATGTGACAAGCGC 

TTTACTAACTTAGCCATGTCCTCAGCTCCCCACTCCCCT'TTTCTTTTCTT 

CTTCTTTTTTTTAGACTTACTTGTTTATTTTTATGAATGTC^IT , GCC'IK3CA 

TGCATACACACACACACACACACACACACACACACACACACCCCACATGC 

AAGCAATTCCAGAAGAGGGCATTGAATC<:CTGAAACTGGAGTTCCAGTTA 

ACTGTGAGCCTGTC^TCTGC<3TACTGGGAGCTAAATCCGG 

AGGTCAGCAAGGTCTTACCTGGGAGCCX3TCTCTTTAGCTCATGTGTTTCT 

CTCTT^AAGCAAGAAACCTAGGAATCATTTTGAAACTTCCTTCACAGCCT 

TTATCATAACTTCACGTCAATTTTTACCTACTCTTTCAACAAATACATGT 

TATATTTACTTATTTTTATGTTTAGCCTGCTATTGGTTTCTACTTAGCCT 

CTTGCAGTAGAGTTCTGTCAGATTTATGTTTCTATTGCTTTTAATTTAT^ 

TGTAAAGGTGAATGGGAAAATATTTAAAAATTACAGATCCCATCATTTAC 

TATATTCTTAAAAGCCATGG<I^AGCCAGGCTTGGTTGTGCATGCTTGTAC 

TCCCJVGGACTCTGACAACTCAGTAAGGAGGAGAGTGAATCAGAAAATAGC 

GCCAGCCTGTGCTGCTTAGCAAGAAACAGAAACAAGTACAATCACACACA 

TAGAAAATCCCCCATTAATACCATCCCATTAGATATAATGGTCCTGTATG 

ACCATTCAACCACTGTTTGTCCTCTGTACTGCAGTAACAGTCTTCTGCCC 

TTGCCCGTGAAGCACGTGCGCACCCCGCCTCCAAGTGCTTTTGCACTGGT 

GTCCTCCGTCTAGATGTCCTGTTACTATATGTAAGGACTGGTTTCTCCTC 

CTCTTTACAGTTCAATCTAATTGTCTCATGAAAAGATCTTTCCTGACCAT 



»SDOCIO: <WO _000S373A2 . 1. > 



WO 00/05373 



22 / S9 



PCT/US99/ 16484 



CTGGTTCAGACAGGTTCTCGCTGTTGTTGTTTTGTTTTTTGTTTTATAGT 

^TAAATTCCTTTCAGGAACTTTTGCTTATTTTAAAT^ 

ACGTGTGCTTGTTGTTGCTCATGCTCGTTGTTTGGGCTTACTTTACTATC 

AGCTCTGGATGTGGTTCACAGAAGGTGCTCAGGGGAGCACTCTCAGCCAC 

TCATCTCACACGGGTTATAGATATATGTATTGATGCTACGTTTGCTTGTG 

AGCCATGTTTTAAAGATTAGAATATCTTTTCTATGTGTACTCTATCAAAA 

CACATGTTAGGGCTTTATCTATTTTATACAGATATTGGTGTTCTTGCTTT 

ACTAATTTTCATGGAATTTGGGTGAATATTAGTATTTTAGATAGGAAGAC 

TTGTCTCAAAATGTAGCTCAGCTGGTTGAGTGCCTGCCTGCATGTAGAAA 

GCCCTGTATTCACTCTCCAGCACCTCAGAAGTGGGCCATGGTGCATATGC 

TGTCATCTCAGCACTCCGGAGGGAGAGAAAAGAGAATCTGGAGTTCAAGG 

TTATCCTTGGCTATATAACAAGTCCAAGATCAGCCTGGGCTACATGGCAT 

CCTGCCTCAAAATCAAACACCAAATCAAAAAGCTCACATCTTCATCCAAA 

AGAAGGTAGAGAGAATACACTGGGAAAGTCTTTGAAACCTCAAAGCTAAC 

TCCAAGTGACAGTGACACCTCCTTAGCAGGGCCATAAATTCTAATCCTTC 

CCCAAAGCCCACCAACTGGAGACCAAGTATTCAAAGATAAGAATCTATGC 

AGTCCATTCTCCTTCAAACTACCACAGTAGGTTTTCTTAAAAAAAGAAAA 

AAGAATATTTTAATTGATTGTGATTATTCAGTATTATTpATGAATAATCA 

TGAACTACATGGCAGGACTATAAACTATTATTTTTTTTAAAGATTTATTT 

ATTTATTTTATGTATGTGAGTACACTGTAGCTGTCTTCAGACACACCAGA 

AGAGAGCATCAAATCCCATTACAGATGGTTGTGAGCCACCAAGTGGTTGC 

TGGGAATTGAACTCAGGACCTCTGGAAGAACAGTCAGTTCTCTTAACCAC 

TAAGCCATCTCTCCAGCCCCTATAAACTATTATTATATTTATAAAATATA 

AATCCGTGAGTCTGTGCACCCCTGTGTGCACATGGATGGGACATCTTTGA 

ACTGGATTATATCATACTTAGAAGAATACAA<3ATACTCTGTTTTGTCATT 

TGGGTGAAAATATGGTCTGTTTATTTTGCAGGTATGACCT<3ACTTCTAGG 

GAATGGCTTCCACTAAACCATTCTGTGAACAGTGTGGTTGTAAGATATGG 

TCATTCTTTGGCATTACATAAGGTAAACTATCTCAACTCTTCACCAAGCA 

AGAAGTTCAACTCTTCCTGTTGCTTTATGTCATTGAATACTATCGAGCTT 

TGGTTTTAGTTGGTATAAGGTTTGTTTTGATGTCATGGAGGTATATAATT 

CACCAAGTTGTCACCAAGTTGTAATTGGAAATTGAA<3TTAGAACGATTTT 

AATCCATGGTGTCTTGCATTTGGATACTCTGATCACAGTTAACAATGAAG 

A^AAATAGTGTCA<5CAAGCCTATGCCCATTATCAAGTCTAGCATACTGC 

ATGCGTGTGACTGAGTAGCCATTGTTATCTGCTTGTTTTGAGGGTATATT 

GTAGAATGAGGCAACTGTATTTTCCACAC<1ATTTTGH3TTCTGTAACACGT 

TTCATGTAGAGAAGGTGATTTAGAGAGGGGAAGAATGTGATTGTATTGGT 

•TX^TTCTTTCTCTATGCTATTCCTAtX^AAGTCACCGAAGAGCTCATGTTA 

ctcacact^ttaa<x:tgggatcacaatgagattgtgaac<:actcattgt 

TK3TTTTCCAATATAATTTTTAAAAAGATGTATTTATTTTTATTTTATGTG 
TGTGGGTGTTTTGCCTGCATGTATGCCTC 

tcagaagaggatggcatcagaactggtggc!^tta<k;tgccatgtgggta 
ctaggaActaaacccgggtcctctgcaagagcagcaagtgttcataaact 

CTCTCTCCAGCCCTAGAGTTGATTTCTTAATGGTTTTAAAAATCCTGTTT 
ACATCTTTCTTATAGGATAAAATCTACATGTATGGAGGAAAAATTGATTC 

aacagggaacgtgaccaatgagctoagagtatttcatattcataatgaat 
catgggtattgttaactgcgaaagctaaggatcagtatgcagtggttgga 

CACTCAGCACACATTGTTACACT^ 

CATCTTCGGTCATTCCCCACTCTA1XX5ATATATAAGCGTTGTGCAGGAAT 

atgacttgggtatctattttttccag^ 

GAGAACGCCTGCCCTTATTATTAGGACAC<MTAACAAAATTCA13CATGAT 
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C^ATCCAGTACCTTGTCTTGAAATAGTATCAGTAGATAACTGCTrA^^ 
T^A^TTGTTGAAGTCCCTGTGCAACAGCTG^^IJ^^I??^ 

AAAAGTCCAATTGTAGGTCCAAGCTGGCAGCTGTATATTGCATTa^^^»^ 
AGCTGAC^AAATTTG<^ATATTTATTTCA^A^ 

aagtcaacx:gctcacagtcaacgtttgcaccctca^™gt^ 

AGGGGGAACTXaAGGAGTCCAGCATGGTCCTGGTTGGG^CArA^ 



STAG 
XTTC 
'AAC 
lATA 
IAGA 



^E^ A 5 A ^ ATACC CAACTAGTTTTTC TC ^ 

rTGTCTC 
3AGTAAC 
rcCAATA 



c 



^tttaatc^agaatttgttt^^ 



cc ^aaaatgcatc^aaa Ga ^^ 



^^^S^^^S^^^CTCTAC^T^cCAGTGCATC 

&GCACACTGA 
rCTTCTTTAT 
TAAGGTGTTT 
kTTTCATAAG 
rCCAoAAAAG 
^AGGGGG TTA 
TACGTTCATG 
^GATGACCTC 



^ AAA ^^ TC ^^rcGTTTAAATAAT™^I A ^ 

sacx:c^aacattcactttaactattgtctc™?c^ 

^TGGAGTATATTACATACTCAGGG1\3CTCTTGTGCAAG^C 
_ ^CAGTAGTGTTTATCATGACAGGACCAAGGCTCTGT^SS 
2T G ^ C / ACAAGGCT ^ 

TACAGATACGATGTGGATACTCAGATCTGGTGG^T^f ACCTC 



AAGAGGAGCC^AACATTGACTTTAACTATTGTCTCTTATC • 'A&&&r 



1 

I 
t 

TACAGTAATAGAGTTAAGGTACTAATGG^ 



"•TTA' 



TATGG TTTG ACTTTTGTTCAGGATTGTATGTAACAAA* 



GTTAGGAGATCAA 
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TTAGTGCTTTAGACCTGATTCACTGAACTCTAGCAAGGTTTGCTCT<CTTC 
AGAATTCTCAGCAATAAAAtSCTGTGCTGATTTTATCCATACTTAAAAAGC 
ATATCCTTCCTTTTCTCTTTTTGGTGTTGGGGATCAAACCTTGTACATGA 
ATAGGCTATACCATCTTTATCCATTTACATCACCAAACAGGATGCTCTCG 
TGCCTATTTGATAGGGTTTTCACTCACTTCGAACTGAAACTTGGGTTGTA 
AGAGTATGGTACTTTTAGCAAAtGGAAATAAATTTGAGTTATGATGCAAT 
TATAAAGCACTGGTCTCTCTGTATTTGCCTCCTCCTTCTACTCCCTCCCT 
CTTCCTTTCTGACCCCCTCTCTCAACATACATTAGAGACCATGCTTTGAC 
TGTCAATTTATGCTGTGCTGAAGATCAGGTCTTTAGTGGCtGTGAACCAC 
GGAGCCTATGCAGTGGAAGTTCTGGTCTCTGGCTTTTGGCTTACTAATAA 

aacactgagcataaattttgatttgtatttcacaatt<:ttacctggaatt 

CTTAAGTGGAATTATGGAGCCATAGAGAATGAACATTTTAGGGCTTTTAA 

TATAGTTTCCCGAAATTTTAACAGATTTTCATGATTGTTAAAGGAAGTGG 

CTTACGTATAGGGGGAAATCAAGTATTGCACATTTGAATCTAAAGTTATA 

AAGTAATTACATTTAAATTGGCAAATAAGTATTCTTTTAAAACTAACCTT 

ATATTTATTATTTCTAAATAAACTCAAAA<3GAGCATTCTTAAGGACAGCC 

GATTTTTCCGTTACTTGCATACAGCTGTGATAGTGAGTGGAACCATGCTG 

GTGTTTGGAGGGAACACACACAATGACACTTCCATGAGCCACGGTGCCAA 

ATGCTTCTCCTCAGACTTCATGGCTTATGACATTGGT#^GCTTTCCAAAG 

ATGTTTTAGCTTCAGGAATATTTTCTTTGCTGA'PGGAAAGATCACTATGT 

TAAAATAATTGCACCATTTAAAAGAAGTCCAGGTGGTAGAATTTGCATTT 

AATTTG AG T AGGG TTAC AC ATC T ATTG AAAAGC ATT ATTTTGG ATTAAAC 

TACATTAAATTCTTTGTGAAATCACTCTTCTTAATTGCTTTAATTCTT^ 

TTT AGG TTG AG TT AATTGGTATCTTC TTTCTT AT AAG TGCCTT AC ATAGT 

AGTGGTGGTAGTTGTAACCACCAGTGTTATGTTAAGTTTGATGGGATATG 

CTGTTTCCTAGAAACCTGGTTTTACACATGCTGTTGATGTCAATATACAT 

gtggccagaagagg<k:agtgtctgtttattcctggaaaataaacatcagc 

■KXITCrGTTGTGTAAATATCAC^CATGTGATGTTCTTTOrGTTTATTTGT 
CTTPTGCATTTTGAGACAGCCTCACTATGTAGTCTAATl^GClCAAKjCTCA 
GTATATAGATCAAGGTGACCTTGAACTTAGAGAAATCCTCCTGCCTCTTC 
TGAGTGCTAAGATTAAAGATGTGTACTACGAATGAAAAAAAAAAATGTGT 
ACTACCACACCTGACTAGAGATTCATTTTAAAAATTATTCTTATTGTGAT 
AAAATGCTCAGAATAACACTCACCATCTTAATGTTTTAAGTAGTTTAGAT 
TTAAATATATTCCTAGTGTTATTCATGTTATAATACCATCTGCTTGCCGA 

c*itcttgtaaaactgaaactct<x:cx:ttaaacaatag 

CCOICACTCCAGCCTCTTGAAATCATTTTCTATATCTCTATGATTTTGAC 
TAGTCTAAATTAGGCATTTTTTAAAAAAAATATTTTGTTTACTTGTATGT 
GTATGAGTGTTTTGCATGCATGTATGTTAAGCACA<X1ATGTATATTCAGT 
GCCCATAGAAGCCAAAAGTAGGCATAGATTGCCCAGAGCTGGAATTACAG 

acttttgtgagccaccatgtgggt<k:tggatactgt<x:ccaaatcctttg 

GAGGAATAGTGAGTCTT<^AGCT<5TTGAGGGATCnnX3TCAGCCCTAGAT 
GTTTGTTTTTAACAAACGTGTTTTTGCCAGCCAT^ 

GAATCGGGGGTACACTATAGTTA<3TCCTTA<3CTTCAA<3CTTGTGGAAGCA 
GAAATGAGAAGACAATATAATCTTAACTCA<5GAGGATTCTTGCTGGCTGA 
AACAAAGATGTGAAATTACCTCCGAGCACTCCTAWSCCACTGGQGTGAGC 
AGGGTGGTCTGGAGAGGCCTTGAAGAGAAGCTGTCTK3AGCTTGTTCCTGG 
GGACACTGGGAGTCAAATAGACCTCCTGGGCA<3GGGGATTTAGTGCAGAC 
AAGAGGCAGGAAAGTACA^TCAAATATTTAGGACTTTTGAAGGGCTAGC 
TTTCTTTTGTCATGGTAACACAGAAGGTAGCAGGTGACTGTTAGACTAGA 
ATGTTCAGATCTGATTCAGAGTGGCAK3GGATGGTTGGTTGGTCTTGTGTA 
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AAGTCTCACAAGTGATAGAATCATATGTGTGTCTTAGACTTTTTTTGTTG 
TAGGTATTTTAGATTTTTCTTGTTTTTCCTTTTTGTAAGTCTGGCCCTCA 



ATAAGTGATCTGGAAGGGGCATATTGACAGTTAGCAGGCTGCTACTGCAG 
AAG TCC TAATTAGGTTTGT ATC AAGG C C ATGG AAGG AGC AGTG AC TTC T7- 

GTACCTGGCTGTTGTGTGTCTTGACAAAAATATAACTGCCCT-T-TCTTCCC 
AAGTGTCTACTATGGACCACCTTTGCCAAAACTAAAAGCAGATTCAGAG^ 
AAAACATATCATGATTGCACATGGCTATAATCCCTGAACTTAGGAGGATG 
AGAAATATGGCAAGATTGAGACCAGTCTGAACTATCTAGTAAGACCGTGT 
CTTTAATAAAAATAGTAAAAATTATAAAATCAG<3GAGTAGGATCTGGGA^ 
GAAGAGAATGAAGTAAGTGTGGGGCATATCCAATTGGAGATGTCTTTAGG 
ACAGAGCTGATTGCTGAGAGGTGGTTGTAGGAGAGGTGAGTTATTGTGGG 
GCATAAAAGATGAGCAAGAGTCAGAGACAGTTGGAGAACAGAGTCTGAAC 
AAGAGTAGAGACTAAAGAGAGTGTCAGAGAAGCAGGGAGAAAATAGGTGA 
GATTGATGACCTGTGAGATATGTTAATG<3CCAGAAGAGTGGCTAAAAATG 
ACTGGAGAATCCTTCAGACTTGTCAACAAAGAAATCCTTTAGCCTAATTT 
AGGGTGCAGGCGGCTGAGGAAGGACATAGGTGAAATATGTGCTCTGTGTG 
TTCATTTTTATTAAAGCTTATCTGCAAAGGCCTCAGATT^TGCTGTGTACT 
TGTAGCTGAGGCTCTTTTGAACTCCTGGTTCTCCTGGCTCCACCTTCCC?- 
AGTGCTAGGATTACAGATGTGTGCCCTAGTTAAAATAGCTGTATACCTAG 
CATTAAAAATTTTAAGTTAGAAAATACTGTGGTGCTCCGGGGATGCATCT 
CAGCAGTAGAGTGCTTGCCTGCTATACACAAGGCCCTGGGACTGATCCCT 
AGCACCACAAATACTAAAGCAGACATTCTGGTAGGGAAAACTGGTAGACA 
GCAGAGTGGTGACCATCAGGAGGGGGGTTGTGGGTGATGAATGACTACAT 
TAATTAGAAGTTCTGTGCAGTATATTTATTTCATGCCCTGAAACATTGCT 
GCTGCTGTTGCTGCTTTCCTTTACACATAATAACATAACTAAAAGACAGA 
CAAGCATGTGGTATGAGGCTGTGGATGAGGCATTCTTTGTTT^ 

TTTTTTTTTTTGAGACAGGGTTTCTCTGACCTGGCTGTCTGAAACTCAGT 
AGGTAGAACAGGCTGGCCTTGAATGCACAGAGACCCTCCTGCTTCTGCCT 
TCTGAGTGCTGGGTTCAAAATTTATGTTTTTTTCTATAAAGACTGAGAGT 
TCACATGGACTATATATGACAACCTACTCTGAAATGTGTTTTTCTCCCCC 
TTAGCTTGTGACCGATGGTCAGTGCTTCCCAGACCTGAGCTCCATCATGA 
TGTCAACAGATTTGGCCATTCAGCAGTCTTGTACAACAGGTAATTGGAAA 
GCAAAGGCTCTATTACTGTCTTACATCTTATATTCATTTTTAATATCAAC 
TTCCTAACAGTTGTATCTGAATGGTAAGAGGTTTGGGGAGAAAAAAGGAG 
AGAAGGCAGTTCTAAGTGCACGATAAGGTAAGGGGAATAGGACTGGGAGG 
TTATGGGGTCAAAGAGCJ^GTCTGAAGTCTGCACTATATCCAGGTGTGTG 
CTCAGCAATACTTTTCTGACCAGCAGAGOTCTTTTTCCATTTGCTCCAGG 
AACCTTAGTCCTGTAAAGGACATGCAAAGGACTAGGGTTGTGGGCCAGCA 
ATAGAGTGTTTATCTAGCTTGCACAAGATCCTGAGTTCTGACCTCAGCAT 
TTTGCCTTCTGCAAACACAGCATTTGCCATAAGGGACATGCAGAATGGCC 
ATTTTACCTAGTCACTTGAAAGTGTGCTTTAAGATTGAGAAACTTAACAG 
CCTGCTGATGCTGACTTTTCTTATTTTGCTTCTGTTACTGCTTTCTGCTT 
CTTTCTTTAATACTCTAATGCTTACATTATATAGTCCTACAGGTATTCAA 
ATTTTCTGTTGGAGTTTCCTAATACAAGTAATTTAACTTGCATTAGGAAA 
AGGATAAAAGTGCCATTCTGGAGTTGTGAAGAATGACCGTTTAGAAGCTA 
GATAGTGGGGAAAGATGATATCTTTAATCATGTGATTATTTAGTGTTTTA 
CAAGTATATAGGGGATTGTGGCAAGACCATTGTATGATTA<3AGACTAAAG 
TG<5AAAGATTTTTTAAATATCTTGTTAACTTGAGTGTTATCTTAAATTAC 
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AATCTGATGCTTTCCTTCAGAAAAA<KXCTAAATGCCTCTTGA<^TTTTC 
ATCTGGCAAGTATCATGTCACCTGGCCTTGCTGGTGGAATCTGCCCCftGC 
TCATGTGTGTTCTTA<5TGTTCTCCTAGCACAGAGTTAGGCACGTGTGGGC 
ATTTGCATACTAATGTATAGTAATAGTAACAATTGAATGAATTGTCTATT 
AAAACATTCTTAAGTTTTACCCAAACACAGAGAGGTCK3ACAATTTGTCAT 
AAAATGTAGTTTATCCATGAATCAAAATCAGGAATGACTGTCTGAACAGT 
<3TTTTTATTTTTTATTTTATTTTATTTTGTGTAATTTCTGTGATGTGTTT 

G AAT ATC TC AG TTTTAGGC AGG ATTGG AAATG TTAG AGGTTQGTAAG AGG 

TCATGGTTGCAGTTTGATCATGAGAGAAATGGATGGCTCTCCCTTCATTG 

CAGTGTTGTCAGTCAGCAGTGTGGGATCACCTATGTCTAACAGTTGTTCT 

AATTGAGAGAGGATTACAGGAGGGAAAGCAGTGAGATTGTGAGGTGCTAG 

ATGAGGAGATGGCATTTACCTAGCA<3GCTTCTCTCCCGCCCTCCCATCAT 

GTGACCTGAGAGATTCACAATTTCTGAAGATATCAGCTGTGCTTAGTTTA 

AGCAATAGTTTTATTAACTAAATCCAACTTGATTCATGTTATTCCCAGGG 

^ACCAGTGGTAGGATTAAAAATGAATCC^AGl<3TTCTTTTTG<3TTATTGG 

AATGTCAAGTTTTCAGACACTGTAACGAATACAGAGCCATACAATCAGTA 

TATTTATTTGGTCCTTTGTTGACTTAGAAAAATTGAAGCCCAGTTTAGGT 

GAGCTACCAAATTTCTCATTCTGGATTAGTATTAAACT3X3GGTGGAGTTG 

TGGGATCTTGGAAGTGGGGGCTAAGCATCCGTGTTTGTCACAGCCCAGAA 

GGAACAGATGAGGTTCCTTTTGAGGAGTCTTATGTCTTTATGAACTTGGA 

CTTAGAAATATTTGATGTGTTTAATTCTGCTGTAGTTTTTTAAACTCTAG 

CTAGTGAGCATCTTTTCACAGGAGCGCTTGAGTCTGACCTACAGCCATTG 

TCTGTCTCTGGTGTGCATATTACAAATGCACTGGGAGCGTTTCTTGACCC 

AAACATATAATTAGATTTTTCTTCTAAAAAGGTCTAGTTTGGGAAGGAAT 

GAAAGGGATTAGAGAAATGTTGTGGGTTTGGTATTTATTTATTTATTTAT 

TTATTTATTTAATGTATATGAATGATCTATCTTCATGTATACCTGCATGC 

CAAAAGAGGACATCAGACTCATGATGGTGATGAACCATCATGTGGTTQCT 

GGGAATTGAACTCAAGAeCTCTGGAAAAACAGCTGGTCATCTTAACTGCT 

GAGGCATCTCTCCAGCCCAATTGTTCTGTTTTAGTTTGA<3GATGAACATC 

TAATTTAGAGATGGGCTGCTTTTCCAAAAGTGAGTTTTAAACACTAATTT 

CCATTGTCAGTGGATTGGTCTTTTAAGAATATAGGTAGTGGTGGCACACG 

GCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCG 

agaccagcctggtttacagagtgagttcca<»acagccag<^^ 

AG AAACGCTG TCTGG AAAAGCAAACAAACAAAAACAAAACAAACAAACAA 
AAACAAA<aAAAAGAATATAGGTra3AATA<3GT^ 

AGTGCATACCTTTAATCGCAGCACTTGAGAA<3CAGAGGCA<3GTGGAACTC 

TGAGTTTGA<X^CAGCCTAGTTAGTCTACAGAGTATTTTeCTGGAGAGCC 

AA<3GCTATATATAGAAACCCTATCTTGAAAGGCCAAAAAA<3GAQGAAAAA 

AAAAAAAAAGAAAAGAAAAAAGAAAAAAAGAATGCAGGTTGGGCAGTCAG 

GGTAAGTGTCTAAGGTAAGAGGAATTCTTCAAGGTGGAAAGTCATGAGTT 

CTGCGCCAGCCTAGGCTA<^GAGTA<^TGAAAGGGGAAGAGACTGTCCATG 

TX3TCAGACCCTCATTTCTCCAAAA<5TCACATGA^TATATTTTTTCTGTAT 

cj»(^cCACTCTTGCATACATGCACCTAACAATAAATATTGAA<3TTCACTCT 

GO^^ACTATAlXrrATCTCATAGACTTCTAGAAAAGTGATTTAAAGTTCA 

AAAGGTAAATA<^TAGTTTTGTTTCAAGTTGGCAAAATCGCTTTAGTAGA 

ctcctacaatcttacatg<k:cagtagcagtatagaagcttgcttgt , pggc 

TTGAAGGCTCACCAATTCAAATATTAGGTAACATTTGTTACATTTTTCTT 

TGTCAGCTCGATAK^TAATGAATGACACAACAATGTGT^ 

TGCATTACTAATTX3AAGTCCTATCACGCACAGCAGACTGAAGAGTTGCTT 

TAATATTTTAt^ACTTTGACAAA<KrTA<^ATTCATA<5CTTCCATACAGA 
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GAGGAATTTCACAAATAGCAAAGTTGGGCTGTTAGAAGAATAAAAAGAGA 
ATTCTGAGTACAGCTTCTCAAAGAAGAGTCCCACGTAGGTGTCCTCTGGG 
ATGTGCCTAGATGCAG<X5TTATTGTACAG<5AGCTCTTCTGTCTGCTCTCT 
GATACTTGAGATTATAGGGTTGCAGGGAAATGCATTAGATGGCATTACAA 
ACTGATAAGATAAAGTTAGGAGCTATCAGAGATTTAGGACATGGTTTTTC 
TCTGTAAATGGGGCTTCTGGTGAGATTCCTAGAAAATGCTGTTTATAGCT 
AGGAATGGGGTTATAGCTAGGAATGGGGAAAGACCTTAAGCAGTTGTGAG 
CTGTGGTGGAATGCATGTGTTTTCAGTTTGCTAAGGCTTCCGGGAATACT 
TTTCCTGTCGATAATTTTCTTTC ACTCTCTTTGTAGCCTTCTTTG T ATTA 
AAATCCTCTCTGCTTGCTTTTGTGTGTGAATGTGTGTATGTGTGTGTTTG 
TGTATGTGTGTGTATGCATGTGCATGTAGGTCCCTACATAGGACAGAACA 
TATTTCCTGGAGTTATAGGTGCTTGTGAGCAGCCTTTTAGGGAACCAAAC 
TCTGTCCTCTGGAAGAGTAGCCCCTTTAACTGCTGAGTCATTTCAGCCTC 
AAGAATCTTCTCTTTTCCCTATTAGTAGAAGATGTCATCTTAGCTCTAGG 
AACTACACCACCTCTGGCCTCAGTGGACACCCATTTACATATGCACATAC 
AGCAGACAGACATATAACTAAAGATAAAATAAATCTTTTTAAAATGTCAT 
TTCCCTGTGTACTAATTTTCCATGTACACACTCACAGGTAGATTTTTAAA 
CTATTCTGAGTGATCACAAAGCAGAGCAGAAGGTGAAATTTGAGAGAATA 
GATGATATTAGTGGATTTTGAGACCTTGAAAATAATGTCTCAGAGCATTA 
AATTAATC ACTC ATGTATG TATG T ATG T AT ATAAG TATG T ATG C ATGT AT 

TATGTGGATGGGGGTGCTGTAGCACATGTGTGGAAGTCAGAGGACAACTT 
TGTGAAGTCATGTTTCTCCTTCCATCTTTATATGGTTCCAGTGATTGAGC 
TCAGATTGTCTACCTGTGTAGCAAGTGCCTTACCTGCTGACCTGTCGCAC 
TAGCCCTCTCAGAGGACTTTTAATATTTGGAATATTTCTAACGATTGACA 
GTCAAAAGTTTATTGTGAGCCAGGCACTTAAAATCCTAGCACTTGTGAGA 
CACAAGATGGAGGTCAGTCCAGTCTACTGAGTTCTAGACCAGCAAGGGCT 
ACACAGTGAAACCTGTCTCAAAAATTTCAAAAGCGGAGCTAGAGAAATTA 
CCCAAGGAGCTAAAGGGAACTGCAACCCTATAGGTGGAACAACAATATGA 
ACTAACCAGTACCTGGGAGCTCTTGTCTTTAGCTGCATATGTATCAAAAG 
ATGGCCTAGTCGGCCATCACTGCAAAGAGAGGCCCATTGGACTTGCAAAC 
TTTATATGCCCCAGTACAGGGGAACGCCAGGGCCAAAAAGGGGGAGTGGG 
TGGGTAG<3G<3ATTGGGG<X^T<KX5TATGG<K3AACCTTTGGGATAGCATTG 
AAAATGTAAACGAGGAAAATACCTAATAATAAAAAAAAGAAATGATATCA 
GAAAAAAATAAAAAAATAAAAAATAAAATAAAATAAAATTTCAAAAGCAA 
CAACTCAAACCAGCCX^ACXsTCGT^CCTCTGAGTTCTCAGTAAATTCCTT 
CTCTCTCTCCTCTCAGCACCATGTATGTGTT<X3GCGGCTTCAACAGCCTC 
CTCCTCAGTCACGTCTTGGTCTTTACCTCG<5AGCAGTGCGATGCACACCG 
CAGTGAAGCTGCTTGTGTGGCAG<1AGGACCTGGTATCCGGTGTCTGTGGG 
ACACACAGTCGTCTCGATGTACCTCCTGGGAGTTGGCAACTGAAGAACAA 
GCAGAAAAGTTAAAATCAGAGTGTTTTTCTAAAAGAAGTATGTTTTTTCT 
eTACTTAGAATTTAAAAATCTAATTTTATCTGAATTGTGAAGGAACCTAG 
TCTCTGTACTTTCCTGTTCACCTTACTCTCTAGTTATTTCTTAATAAAAA 
AATACACAAGATCTTTGGATGGGAGGAAGCATGTGGCTCCTGGAAGCTGT 
TAGCAGGTAATAAGTTGT<^TTGAATTACACAGGCTTTGTGTACCAACTC 
CTGGTCTGGCTGCAGGTGATCTGAAGCCATAGCACAATGAAATTTGTTTT 
CATTTTGGTTTTATGAGACAGGGTCTTGCTCTATAGCTCATACTGGTCAA 
GCTCCTTCTCAGCCTCX:TCCTTCA<3CCTCTT<3AATGCTGGGGTTATAGGC 
ATGCATCACTGGCCCTACTTGGGAAATATTTTGATGACAGACATGCTATA 
TATTTCTTTGTTCAGTTTAGTAGCCACTAGCAATCTGTTATTATTAGATA 
TTTGAAATGTGGCTATGTAACTAAGGGGCTAACTGTTTTCTTTTCTTTAG 
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TGTATGTAGTGAGGCAGATCTAGTAGCACACGCCTGCAATCCAGACACTC 
ACGAGGCTGAGGCAAGAGGCAGTTCTAGGCCAGOCTGGGCTGTGTAATGA 
GACCTTGTCTCAAGAGCCAAAACATCAACAATAAAAGAACAGTATGTGGC 
TATTGGCTGTTATGTTGATGATGAAGGTCTAGTGTTAAGGATAAGAGCCT 
CTAATGGTATGATCACATATAGCAAATTGCTCTGGTAGACAGCAGAGAGC 
TGCTGTTCTTGAAAAGTATTTCCAGCCCCCTTTAGCTGTATATAGCAAGC 
AGTACAGCATAACAGACAAACTATGGTCCCTTCTTCTAGAGCCCCTGGCG 
TGCTCTTGTTATTTTTCTCTCCTTTGCTACTTGCTTAGTGGTTGCTCTGA 
GCACCACTTCACCAACTCAGCGAAGTAACGTGCAAAAATGTTTGGAAAAT 
AAGAATGCCTCCAAGATATTTGTCCATATCAATCTTTAAAGTATGAAACT 

acttccttatctagttgttgcagttacatgagagttatattaggcagaga 
ctacttctgtttttctggtatgtgttaaataaa<3ttgtgcagggacataa 
agctcctgaggctgtgctgttgattagaattttggttcatttatggaaaa 
cagcttaccagaacctggtaggattcataattctcccgaaacagttagaa 
ttgg t ag aat aacc aaaattt aaagtt aagc tt aaat at ac ag tg c attg 
gaaataatattatcttctgaggttcagtatga<k:ccattagtttacctca 
ctttctgggtagacctaatcctgtcagagtaaacttggcaagaaaagcag 

CCTACATGAAAACTGATCAGGCAGGGAAGTTTCTGTGGCCTCTCTTCCTG 

CTTGTGTATGTCATATTCATGAAATGATTTATAGATGGCAACATGGCTTT 

TAGCTTCTTGTTTGGGGATTTAATGAGAATTATGTTAGGTCTACAAAGAG 

TGGAAGTTGTGAAATCCACAGGTTTGGAGTCACATGAGTATATAGAGTTC 

GAGTTAGCAAGTGCCTCCTGTGGGGTTGTGGGTCACTGGGTATACCTGCA 

CCCAGGTAGGCCTTGCATTTGTAACAAGGACAAATGTATTGGTCTCTCAT 

ATTGCTTTCTTAGGCTTCTGCACAGCTTCTGGTGTTAATTCTGTTGCTAG 

TTGATGTTTGTCGTGGGAAGAAAAGCATCCATTACTTCTTAGAAGCTATA 

AAATTAACAGACCTTTGCTTTTCACTTTCTGGACACTATGGGAGGACAGT 

TATAAAACAGTGTTTCTCXKSATTGTCTGCTTATATCTGTTTTATTTTAAC 

CTAAACATGGCACTGCTTTTTTCCTTTCAGTTTGACTATACACTTTGCTT 

CCTGACTATTGTTAGGAGCTTTGCTACCTCAGATTATACATAAGAGAGGC 

TGCCGCATAGTTGATGGGTTTGTCTTCTCTCTGTAK3CCCTTGACCATGAC 

AGATGTGACCAGCA<^CAGATTGTTACAGCTGCACAGCCAATAGCAATGA 

ctgccactggtgcaatgatcactgtgtccctgtgaaccacagctgcacag 
aaggccaggtcagatgctgtttttcacggattttagggaatagaaaaatg 
ctagatgagtgtcagtgtagggcaaataatgagtagagttctttttaaaa 

tgggatatcgatttgaattctactgt1kx:tca^ 
gatgctatata<^tccigattccaaggatcg<:tcx:tgctgx^a<X3tc^ 

•IGTK3CAGTGTTTCCGAAAGCATGTTTTACAGAATQCCCTTGQCGCATATC 
TGACTCAGCATGACATCTGGGCTAATCATGTATGATTTGTTATAGGTGAT 
AATAGGCTATGAGTAAG<3TGATCCAGCTTTTGCTGTCTT1H3ATGGCTTAT 
GACATTTTTTTCTCAAAGTTTAATGCATTTC^ 

ATTGCTAT^TGGGCA<X3GGC1XKMAGGA<3CTCTG<3AAAAGCAGCAGGTT 

cagctttcacgtttta<^gataag<attggctga^ 
tggttccgttgggctgctagcttgccagctaaaagcatgttagtgagaat 

ACACACTGTGGTATTC^<^TTX3CAGTGCTCCTTC^ 

TATCATTCATCCATCTACCTATCTCTATCTATCTATCTATCTATCTATCT 
ATCTACCTACCTACCTACCTAGCTACCTACCTACCTAGCTACCTAGCACT 
TATCTAATTCTATCTCTCTCTCTCTCTGTCTTTCTGTCTATCTATCCTGC 
ATCTAATTCTATCTATCTGTCCACCTATCTATCATCTAATTTTATACATC 
CATCCATCTATCCATCTATCTGTCTGTCTATCATATATGTAATTCTAACC 

attcatctatctatccacttatctgtctgtcatctaattccatcx:attta 



BNSDOCID: <WO 0005373A2_I_> 



WO 00/05373 

29 / PCT/US99/16484 

TGTATCTATCTATATATCTAATTCTATCTATTCAATTCTTTTCTTTTTTT 
^^^ C ^ TCCAGTTA CCATTCTCAGTTA^^ 



GAA 



GGG^CGATATTTTGTATGTTTTTAACTTAAATC 

GTATTTACTAGCCTTTGAAAGAAAGTGAAGTGTCAGCTCATGTTCTGGA^T 
AATTGGGGGGTAGCTTAGATCGATGTTACAAACTGTGTCCCAC^GTC^^T 

™^T^ TOreAAGGAG ^^ 

^TCAGGAACCTGCAGGAAGCACTTACTGACAGTTGTGTCAGAAGAGA? 
^TGTACCAGCATCATCTCCCATGTGACCTTCCTTCCCGACTAT^G 

Cl^TCTCTATGATGTATTTCCTTGATCAGGATATCCAGG^ 
^TCTGCTCTCCAACATCGAGGTTTGAGGGGAAG^TCTCA^^ 

tcaaagcattttatttagtttgctgaat(^tttag™ 

CTATTGCTGTGAAGAGATACCATTTCCAACGTGTAACTTT^ATGAAAGGA 

aacatttaagtc5g<^cttgcagtctcagaagctatta^^^S A 



^ A SS ATAGAGGCACAAAGGCAGGCATT AGAGTX3GTAGCTGAGAGCTACA 

tcctcatctgtgagcagaggcagacaaggtgtgaaaaa^acagaac?^ 



CCAACAGCTCCTGCAACAAAGCTCCATCCCCCGATCCTTCTCCAGTCCTG 

ccactccctggtgaatgagcactcacatatatgagcctatg^?S?? 

CTTACTCAAGCCACTACAGGCTTTGTTTTGTGTCTCAGACTTT^ 

CTGACTCCCTGCC^GTAGCACATTCTGAGGAGCAAGTCCCTTAACTCGCT 
TACCTGCTCTTAC^TTACGCCTTTCCCTGACCA^AGTCAG^^?^ 
GTrcTCCCCAACCTGAACCTGGTTCTGGGGAAACACTTGCTTA^TCA^TT^ 

^^ T ^^ c ^^ agcaagga ^^ a ^^a?^aS?^ 

A^GGTTTTCCi^CC^GTCTCCCTTTCACCTCATT^^ 

^TACTTGCTTATACTTTCTATCTTATGACATGAAAAC^ 

TTGGA^CTTAAATTTATCA<^TTCCC^TTCAAT^ 

J^S^ TCTATATACATCAGTCTCCAGATAA ATATCTC^ 
A^AGGCCAGAGGTTAACCTCTTC^^ 

^^^^^^ '^ GG '^ UrG ATG AATG AG ^^^^*^ ATTAC ATAG ACTAGCTGA 
AAGCAAGCAAAACTACGCCTGGCCTTTTATGTGGGTX3CTTGGAATTTGAA 

c^tacttatgcttgacacaagtattt™^ 

AGCCTCC^TTTGCAGTTTTTTACCTCACCCTTCCAA^ 

i g ~ a t g ^ c, ^ t '^ ga ^ 

AATAATTGCACCAATTTCTTAATAATG^ACCO^ 

agg ^tttttattctggccctc^^ 

^tcttcctctgcaggacccgaggtccccacxxsgSag^ac^^ 

^rr G ^ ACTAAcccAcAc ^ AAGGcAcAcATA cACAci^^ 

ATAAATTATTTTATTTTCAAAGGTTTTT^ 
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TTTGTCTGTTATGCTCACCAGATCCTAACAAAOCACCTCAAATTCAAATC 
AGGATGAGTTCAGATGTTCAGTATTTTGAACTAGTAAAGCGAACTGCATA 
ATTCCTAAAACTTTGTTTTCTTTCCTCTTCGCCTTTAAAAAAGAAAATAT 
CTGTGGCAATGGCTGGCATTTGGTTGGAAACTCGTGTCTGAAAATCACTA 
CTGCTAAGGAGAATTATGACAATGCTAAATTGTCCTGTAGGAACCACAAT 
GCCTTTTTGGCTTCCCTCACATCCCAGAAGAAGGTGGAGTTTGTCCTTAA 
GCAGCTTCGATTAATGCAATCATCTCAAAGTATGGTGAGTTAATGTGTTC 
AGAACTTTGGTTTCTAGGGCACAACAGCAGCTCTTATGTAGAAGGCCACA 
GTTGTATGTTATTTGCCTGGTAAGAGAAAGAATTACAATAAATGATTAAT 
AATATACTGTGGGCCTCTATTTCAGAGGCTCTTCTTTTGATACCTTTCTT 
CTTGTCTTAAAAAGTTCAGTACTTTGCATATTTTATTAGTTCTTATTATT 
AAGTAAATTATAAGGTATGAACATATGGAATGAATGGTAATATGTGTACA 
T ATTC TGG TG AC ATC AG ATT ATTTTG T ACTTG ATTTATATCTAG ATTCTG 

CTTGGGAAAAGGGAGAGTAAAATGTTAGTTACCTAGGTGTCATTAAAGCC 

ATCTACAGCCCCTGGAGGTATTATTATAGCACATAGTCTAATCGTCAGTA 

AGAAATGTAAAATCTGCCCAGGTTTTATAGCCTTCTTGCTAAGGCTTCTG 

AACTCAGAAAGTTGTCTTACTCTAGAGCCAAACTCTCAAATGGCTTGTAG 

TTACTATATAGTCTCATTTGGTATTTTTCTTGGTAAGTCTAATTCTAAGA 

CTTGTGATTTGACTGTGATGCTTCAGTCAATTAGATATTC^WCAGAGCA 

TTTTCTGTCTATGCTGGCTGTGGTACAGAGAGATGTGAGGGACATGTTTT 

tgtctagccag<3agaagacagaatgcagctcagcatgtctcatttggcac 
caccttcatgtgatgggatgccggtatggtgtgggtcctggttgttaaat 
ctcaggaagtccatatatccagaaatgacctcaactataggtggatttct 
ggcaattaggtaaaagtcagcattccttgggcacttgggaaactggttac 
catctgcataaaggagtcatttcc<:ttctatctggcagaagggacatatg 
gctatctattgtgcctgtcagcatggaagcacatgctagtctccaggicc 
ccccaatatcacaagtacctatagcagtgaattagttaaactgatttggc 

TCCCAATGGGTCAAGTACAGCTGCACCTGCCCAAGAGCTCTTTGGGTT1X3 

caaatgagagacacatagttaatttttatatgctttcactagttcagttg 

ctggacatttctaatcctccctgcagtagcatacattaacgcx:tccaact 

ttcctgagtcaacttactaactcaacatt^k^tctctga<laccc 

taatggcagagtggcccttagagccactttccc^ 

atattttctttattttcatttccaatgtc<x:ctttgctagttt^ 

ctctccccctgctccccaaccca<xxiactc^^ 

tcxtcctatactggggcatagagccttcac^ggatc 

cattcatgaccgactaggcc^tcctctgctgaatata<^<3ctagcaccac 
gagtcccaccatgtgttttctttgattggtggttta 

tggggtactggttagttcatattggtgttcactttcccaaattcttacat 

ggctggtttagttctttcodscagctcttaggtctaatc 

tctgtcatggtgattgcc^rcctct<:ctatctc^ 

atctaaaagtcccacctccatctttctgcgcagccactggctgtat'gcag 
ttctttattatcagttgaagccagctaggggcagagaccttcaggtctgt 
■^gtgctttggggagcagaattaagacaaagcattagaagcaattcccaa 
caagtacctgctatacatttcaaagtccatattagtctcctsggtcttcc 
cttccccagctacttgtcc j ixxtttgtaatccaaatga<1aagctt^ 

acatctctttatctcacatttxx^ctaggcctggccatctccact^xattct 

tttactctctxsctclgctctctttrccaatggctctggatattti 

cttattcacaataaaaaccaaaccaaaccaaacaaaaaagcttacgctaa 

taatggagtggtcacgcctgaggtttgcttactgctggegcttgcacagg 

tcttgtgtctgacacactggcaggcttttattagcagcaggctctaggag 
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gaaaactagcatgcactctga^^ 

G<^AGAAGGGCATGCCTCAAG^ 

ACA TAGAAGATTATGATCTCA^^^ 

GGAACKK^CTGGriX^TTCATAGC?^^ 

GGIKX^I^AGGATATCTCT^^ 
CCATCTGAGCCCAGTGATGCTGGCTTC^TGTGGG^ATY^ 

TACTCGGGGATTAAAGGCTrr ^ ^^ GAGCCTA G 
GCCAAGACACX^GGAGG^CA^C^^^ 

acttatgg^gcagggag^agSS^^^^^^tgatgaagt 
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CTTTGCCTGAGTAATACAGCCTAGGAGCTACTTTCTGATAGGGTTTTTT.£ 

AATACTTAC AAAG AATTATTTATCTTTAATC ATGTGGTTTTGTATG TG TG 

TGCTTGCACATGCAGTGCTTGTGAGAGAGAGTATGTGTGAGAGCATGCAT 

GTATGAGAGTGTGAGAATATATGTGAGAGAGTGTGAGTGCATGTGTGCGT 

GTGTGCATCTGTGTGTACAGGTGTGTGTACATGCATGTGTGTATAAGAGT 

ATGTGAGAGTGTGGGTGTGTGTGTGAGAGTATGTGAGAATATATGTATGA 

GTGTGTGTGAGTATGAGTGTATGTGC<3TGCCTGCATGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGAAGTGGCCTTGGAAAAC^ 

GAGTTGTCAGATCTCTTAGAGATATAGTTGCAGTTGGTTGTGAGCCATCT 

CATATGAGCGCTGGAAGTTGAAATTGGGTTCTCTGGAATCCTCTGGGTTC 

CTTGTTGAAGCCTGAATATTTTGATAAATATTTATGTCATTATCCCTCAA 

AATTGTAAATGTAGAATTTAACAAACTCAGGTCTTGAGTCATCTTTGTCC 

CAAGGTTTGTTTGTTTGGTTTTTTGTTCCCCCACCTTTTCTTCAGTGCTT 

TTAAAAAAGAGAGTCCATTTTTTCCTAAATGTTTAAATACAGTTGAGGAA 

TAGAACATCTGACTCCAATTTCCTGGGTTTCCCTCCATGTAGTGTAGTGC 

TGACCTGATTTCAGTGTGCATTGAAAACTTTGATCACTTGGAAGGCAGCT 

ATGCTCACCACTATACTACCAATGTCTGCAATCCTATAGGAGAAACAACA 

ATATGAACTAACTAGTACCCCCCAGAGCTGTGTCTCTAGTTXSCATATGTA 

GCAGAGGATGGCCTAGTCAGCCATCATTGGGAGGAGAG<3CCCTTGGTATT 

GCGAAGATCATATGCCCCAGTACAGGGGAATGCCAGGACCAGGAAGCAAG 

AGTGGGTGGGTTGGGGAGCAGTGCGGGGGGGGGGGGTATAGGGGGTTTTG 

GGGATAGCATTTGAAATGTAAATGAAGAAAATAACTAATAAAAATTGCCT 

taaaaaaaaacaaaaaagaaaagtttttgatcttaisctgaccagtgtctc 

TTTGGGTCTTAATTTCCAGCAAACCACAGTGCCAAGCAGTCCCGGACACC 

atgtgccctgcggacagcgtgtggcgagtgcactagcagcagctcggagt 

GCATGTGGTGCAGTAACATGAAGCAGTGTGTGGACTCCAATGCCTACGTG 

GCCTCCTTCCCTTTTGGCCAGTGTATGGAATGGTATACX3ATGAGCAGCTG 

CCCACGTAAGTGGAAGGAGGTTTTGAACATTTGCAGGCAAGTTGGGCTTG 

ACTTTCTGCTCAA<3TCCATGCA<3AAGC1^TCG<^03GCC^ 

TAACATGTATGTATAGAATGCAGCACAGTGTTCCATGCAGTAAATCAGTT 

ACATCAAGGAGAAGGCACAGGGTACAGAAATACCTTTTCTTCTTCAGGGT 

AATATTATAATTCAATCTGTATAATGTTTCTACATCTTAATCTACCAGTA 

TGTAAAGTGCTTTCTAGTAGAGGCCTC(^CA<^TGCCTTTTTCA^X1AAC 

ATCCTGATATTAAAAGGTTGGAAAAGTCCCTGTTATATATTATCTAAAAT 

GTGGGGGCCTTT AAATTATTTCAGTTCAATAATCACTATAX3GGTACTATT 

TTTAATTCATGGAAGTTAAATCATCTGTTAAAAGAAAAGGTAATAACAGT 

AAATTCAAATCyTTGTGATAGTGAATTACAAGTTGGATTGTTTTGCCTTGT 

TTTTTAATA<^TGAAAATTXXTTCTGGCTACTGTACCTGCAGCCAT1X3CTT 

GGAGCAGC(^«XnX3TGGTTGGTGTACTGATCCTAGCAATACTGGGAAAG 

GAAAATGTATTGAGGGCAGCTATAAAGGACCTGTGAAGATGCGGTCACAG 

gcctcokx:a<maaatgtgtatgcacax^<x:ttctcaactccagca«pgtg 

TCTAGAGGACAGCAGATACAACTGGTCTTTCATTCACTGTCCAGGTAAGA 
TGCCTGTGTATCCTAGTTCAAATCTCGTACATAAACTAGACGCGCAGA'TC 
CCTTGGCTCACTTGTTTTCTTGACTGTGTTTGAGTTCTTTCTGTCTTCTG 

catcaccttgttggatcata<3ctggcaaasgtgctctcx:t^ 

TTTTTCTTTACTTGATTGATTGTTTCTTTGGTTGCACAGAAGCTTTTT 
CTTTCTGAAGTCCCATTTGCCAGTTGTCCTTAATTC^ 

GCCTCATAAAAAAAAGTTCCTTCCTACACATGTATCATGTAGGGCACTGC 
CTATGTTTTATTC^<3AAGTTTCAGA<^T3^ 

TTTAGGGTTACTTTT^TCAAAGGTAA^^ACACA<3TTCTCTTTCATTCA 
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^^^^ 

^TTTTTATTA^^GTTTCAGTCTCCT 

ctaatctccttatga™ 



^AAGKSTGTTTGGCT^^'^?^ 
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CAAATTCACAGAGATCCACCTGCCTCTGCCACIXjAAGTTX^TGGGATTAAA 

GGTGTGTGCCACCACTACCTGGCTGATTTTAA<3TTTTA 

C AGG T AGGG T AC AT AG ATTTTC T AC ATTTG TG AAGG TTTGCGTTG T TTG T 

CAGCATGTAATTCTGTGTGCTGCTGAGGGAATGTATGTTC 

T AGG TGG AAAAG TCTGTAG AC ATC TG TTAG ATCC ATTTTAC ATTTC AAG A 

AGCCATTTAATTCTGAAGTTTCTCTGCTTATTTTTTC 

CTATTGGAGAAAATAGGGTGCTAAAATCATTTACTATC 

AAGAAGAAAATAATTAATTTAAAAAACCCTGGAAAGAAAGAT^ 

TG AATCATGTTTCCTGGATA<3TGGGGTTATAT 

TCTCAAATACTGTGAGTTTTTACAATCAATAACAAC^ 

GTTGCTGTGGACTTTAACTTTGCTTTGATAAT^ 

CTAATTTCTTTTTGATATTTTATTTTCTC 

TGGTTTTGTTTTGTTTGTATTTTTTAGAC 

GTCAAGGGTGGCXTTTAAAATCCACACCCAATACTTTGTCCTC 
TCTTTCTTTTTTTTTTTTAT^ 

TATGTTACCCAGGCTGACCTGAAACTTCTGGGCTTAAGCAAGA 

ACATGATCAGAGAC<5CTGCGCTGCCCGCCTCA<XXXTCTGCT7^TTGG7^AC 

TATAGGCACAGACA<5CTGTACTTCACTCATTTCAATGATTTAACATTTAG 

ACTATATGCAAATAAATATGAAATGTATTCACCAAGTTCTCCTATGGGAG 

AAACAGAGCCCTTAAGATTTTTTCCTTTCA^ 

CAGCAAATGCATCAAGCAGAGTATCTGTGAGAAGTGTGAGGACCTGACCA 

C<3GGCAAGCACTGCGAGACCTGCATATCTGGCTTCTATGGTGAJC^ 

AATGGAGGCAAATGTCAGCGTAAGTCACACAGGTCAAGTTA 

CAGGTACAATAGTACAGTACCTGCAGTTGACTTAAATATCTTAAAGGGAA 
AAGKSCCTCTTGGTTTGGGATATTGCCTTTCT^ 

AAAGTTTAACTGAGGGGCTAGAAATGTGGCTCA^TTGGCTAAGAACA 

ACTGTTCTTCTAGAGGAC(XSA<^TTCAATTC^ 

TCACAAGTGCTTGTAACACCTGGGATCCAACAACCTC 

TGCATG<IIAAAACACTAATATA<IATAAAATAAATCCAT^ 

ATGATGCTGGAAGAGGAAAAAAGGCTCAACTTCTGGGT^ 

AGTTAAAGCAACAAACCGACAGTAAAGGAGCTAAGCTTT^ 

CAGA<3GCATAAACAAGGGGCCGAAGTCACTGA<5G^ 

TC<1ATTTCCCTC<TCATGGAAGCACATCAGCTCAA^ 

TGGGATGGGAGGTCATCTCATTGGA<5AAGGAt3QCA<^ 

GGAGGGAGGACAAGGCTGGGAATGGGAAGTC<n>^ 

GAGGACAAGATCTTCAGTTTCCTTCTTA^ 

TCTCTATAGAAGTCTACTGGAAGCCTCACACAGGCAC^ 

AAAAACTGTGACAGCCAGGGAGAGTCCCCTTCTGAAGTGTC 

AGACTGCAGCACCTGACTGTGCCCCAGTCTGC^ 

AACTGACCTCCTGAGGAGCCCAGATGAATCTTTAAGATGGCCI^GCTTTTG 
GTTTTGGTTGGTT^TTTTTA^ 

TGAATTCTCTGTCCTCCTGCCTGACCTCC^ 

CTCCCATTAAGTTGTGAGTTTGGGTGTCTGGCTCCT 

TGCAGTACATTGAGCTCCATA<5J^ATAGCGGCGGGGCAAATGA<3AGCTG^ 
ACGGGCACTGGGTGACTC1X3TGCCTTG 

GGCAAAGGAGATCCTAA<3AA<XXK3At3A<X3CAAAATGTG<?TC 

CTTTGTGAAAAGCTGCTGGGA<MA<^ACAAt5AA<3AAG^ 

CTGTCAACTTCTCAGAGTTCTCCAAGAAGTGCTCAGAGAGGTGGAAGACC 

ATGTCTGCTAAAGAAAAGGGGAAATTTGAAGATATGGCAAAGGCTC 

Gk3CTCGTTATGAAA<5A<5AAATGAJUVACCTACATC^^ 
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TCTCATCTATAAAGCATrT^C^c^r^ GCTCGTOTAG ^Tm 




TCTCTCTATCTATCTATCTA^^^?^ CCmAGC ^CTTGGGTAC 




gag <X3aggacgtc^aagg^^^ 



BWSDOCID <WO 0005373A2J.> 



WO 00/05373 



PCTYUS99/16484 



36 / 89 



ATAAAATGTGTTTATGACTTTGTGGTGACTGCCATTCAAGAATTAGATGC 

CTTAGCCAGCAATGATGGCACACGCCTTTAATCCCAGCACTTGGGAGGCA 

GAGATAGGCAGATTTCTGAGTTCCAGGACA<3CCAGGGCTACACAGAGAAA 

CCCTGTCTCGAAAAAACAAAACAAACAAACAAAAAGATTTCGATGTCTTT 

ATCACCCAAATCAAGTAACTTTCCAAAGTCTCACAGTGAGATGTAGCCTA 

GTTGGGAGCCACATCTAATATATGCTGATGATCTTAACAAGTAGCCTGCT 

TGTGTCTTCAGGTGACCACCCCGGTGTCCTCAGCTACCTCTAGAAAGATC 

ACACTTTCCTCTGTGGTCTCTGCAGGGTCCCTGTATGATTCTGGAACCTT 

GCTGTACTTCTCAGAGTCCTGATTCATAAAGCACTGAGTTTTTGCTTGTT 

TGTTTGTTTTGATACTATTGGTAAGAATATATATTGAACCTTGACA'PGCC 

TTTTTAAAATAACATTATTTTTACAATAGTACTTTAGCCTTGATTATGTT 

AACTGCTTACTGTTTCAGATGACATTCGTACATCTTTTAATCCTCAAACC 

AGTCCTATGAGATGGCTAGCATCATTGTCACATCATTTAGGCAAGGAAAC 

AGGTCTTGGGTTAAGCTTCATGCTCAGAGCTCCTTGGAACACAGTGGACT 

CAAGTGCAAGCAGACTGACGCGACTGGGTTTTACTAATTCAGTAAGCCTG 

TACTCTATGGAGGAAGAGTTTCTGACCACTGGATGCAGTCTGATGACCTC 

TGACTGTTCTGTTTGAAAGGTTTCTTTCAGTGATTTTATTTTTCTCCATG 

TGGACTTTTTTTCCAGCTTTTAAAATATATATATATATCTTATTCK3CTTC 

ACATCCTGCTCACTGTCCTCCCTCCCCTGTCATCCCCTGGTACAATCCTT 

CATATCCCCCCTTACCTTCTGAGCAGCTGGGAGCCCCTCTGGGTATCCCC 

ACACTCGGGCACATCAAGTCTGTGAGGCTGGACGCATCTTCCCCCACTGT 

GGCCAGACAAGGCAGCCCAACTAGAACATATCCCACAGACAGGCAACAGC 

TTTTAGGATAGCCCCTGCTCCAGTTGTTCAGCACCCACATGAAGACCAAG 

CTGCACATCTGCTACATATGTGCAGGGAGGCCTAAGTTCAGCCCATGTAT 

GTTCTTTGGTTTGTGGTTCAGTCTCTGAGAACCCCAAGGATACAAGTTAT 

CTGACTCTCTTAATCTTCCTATAGAGTTCCTATCTCCTCTGGGGCCCACG 

ATTGGTGTCGCTATTGCTTCACTGGGATTCCTCCCTGGCTACACCCACTA 

TGACCAAGGCAAGTCTTAGAAAAGACAACATTTAACTOGGGCTGGCTTAC 

AGGTTCAGAGGTTCAGTTCAGTATCATC^AGGCAGGAACATGGCATCATC 

caagcag<x^tagtatagaaagagctga<3agttctacaacttatctgaag 
gctgctagcagaatacggacttccaggca<x:taggatgggggtcttca<3a 
cccacacccacagttx3gtgtccctattgcttcac , i\3gggttcctk3c<?rog 
ctacaggaxx3tagcxztcttcaggttccatatccccaatgctgtgaggcac 

AGTTAAGGTCACCCACTATTGATTCTAGGGTCTCTCCCTCATGOCAGGTC 
TCTTTCATTGTGGAGATGCCCCCCACTTCCCC^^ 

TTTCCATTCTCGGGACCATCTGGCCATGCCTTC^ 

GATCCCGACACCCCCGOCCATTCCTTCTCCTACCTAGTTCCCTCGCTG<^ 
TATGCTTCCTATGACTATTTTATTCCCCCTTCTAAGTGAGATTCAAGCAT 
CCTCAC^'TGGGCCGGCCTTCTTGTTTTGTTTCTTTGGGAC'PGTQGA'GTGT 
AGCTTGGGTATCCCATTTTTTTAiKXX^AATATCTGCTTATAAGTCAGTA 

cataccattcgtgtccttttg<x5attgagttacctcactcaggatggtat 
tcttaagttctattcatttgc<:tgcaaaattcatgatc 

gtaactn3aata<3tagtccactgtatagatctaccaca<3tttctttatcca 

TTCTTCAGTTGAGTGAAATCTAGGTTGTTTCCAGTTTCTGGCTATTACAA 
ATAAAGCTGCTATGAACATAGTGGAGCATGTGTCCTTGTGGGATGGTAGA 
GCATCTTTTGGGCATA1GCX:CAGGAGTGATGATATAGCTGAGTCTTGAAG 
TAGAACTATTCTTAGTTTTCTAAAAAACCACGAAATTGATTTCCAAAGTA 
GTTG TAC AAATTTGC AC TCCCTCTAACC AAGCAAG TG AAAG ATCTK3TATG 

ACAAGAACTACAAGTGCCTGAAGAAATAAACTGAAGAAGATATCAAAAGA 
TGGAAAGATCTCCCATGATCGTGAATAGGTAGGATTAACAAGGTCAAACT 
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TTAATGTTGCCTCTATATTATTAG^i^rTr^r^^^^^^T 

CCACCTCCT^CTG^ATTG^C?^^S C ^ CCCTCTCG C 
GGACTAGTGTGGAGGT1TACCTTTT 

AGGATAGAATCAATGACAAA^^A^^S^ AGATAGGAAGAA C 
GAAGCATACTCAATCXACACA^G^AAA^ 

TACCTAGCATTATTACA^TCTA^??G^S^ TOOTATCA<5T AG 
GTTTATTTGTTGTTTTTATAC^rcT^^ 



TAG T '^ATGTAAAAGG ATTTACTT<^ 

gct^aaatgtcaatagaag^^^ 



T< ™TAATTGTTTCTAGAGAGTAAT'rr^^ 



GT ^GCTCCCGCGGAGGCGTGCTCAT-^^ 

raTGTccAATcrccAGACAGc^^ 
gttccacgattx^tctctgcagctca 
aacaccaaattaatac^^ 

WATCATAAATATATTTTTOTCT^ 
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CT ATG ATTCT AAAGG TC ATTC ATC AAATAT AATG TTAATG TG TATTTTAT 
TAAAGTTTCAAACTTCTATCTTTAATAATC 

GGAGAGCTCTCGCTGTAAGGTCCTGTGTTCTATCCCCAGCACAACAAAAC 

AAG AC ATTT AAG AAAAAATT AAAAC AAGTTGGC TG T ATTG TC TC AG T ATC 

TCATCCTTGAGATAGTGAGGCAGGA<MACTTTTAGTTTGA<^CCTATGTC 

GGTTATGTAGTGTGAAACCTTTCTCAAATAATATTTA 

AAAAAACAACTTTTTTCTTAATTTATGTGTTT^ 

GCAATGTGAACATATCTGTTGCCTTTGAATGCCAGAG 

TCCTGGATCTGGAGTTACCAAGGGTTGTGA<X:TCCCATA<3 

TAATGAACTGAGTCCTCTGGAAGAGCAGC<^<3TGCTCTT^ 

CATCTCTGCTGCTAGGTACTCCCCCTTCCCCCCTTAAATTTAAGACAAAG 

GTCTCACTGTGTAGCCTCA<3ATGGTCTAGAACTCAATTTGTAGAATGGT^ 

GACCTTTGAACTCACAAAACTCTGC<^I^TTCTG^C 

ATTAAAGTTGTATGTCACCACACCTGCCe<:TATGATTTCTATATTTAATA 

7^GATCATGACTAX^ATATAGAGAACACTTTTAGAACTGAA<5AAGAAGAC 

AGTTACAGTTAAAAGCAAAACAAAAACAAAAACAAAACA 

AAAATAGAATGAAAACTAGCACTGAAGAAAAT^TAAATTTTAAAAATAGG 

CAAAGAGTCACTATTATATTGTGATK^ATGTGTTATATGTTTAAAACCAC 

AAGTGAGATACAGGCCTGAAATGACTTTAATC<3AA£X^ 

<X3TGGTAGTTCAGTTGGTAAAGTTCTTC 

GTTTGATGCCCA<^ACCCATGCTGAAACCCAGGAGTGCTGCTGAGTGCTT 

cagctctggggtx^ca<x^ctcactggcaggaagccta<k5ctaagagaga 

ctctgtctcgaaaaacaaggccgatggcacctgatgaacggcatctcagc 

atgacctttgctk^ggcatataatgtgtacacacaaattcata<3tttagta 

gaagacaagtatgatctgcttttcatgaagtctgttc 

ttagttaaccatagttgcttaaaaaaagaaaaaaatc^ 

agaat^tggatagagtgttcttatagccaattcaattcatcatcattatc 

aaaacctataacttagggggctggaga<3at^ 

tgactcctcttctgaaggtcctgagttcaaatccca<3^ 

gctcacaaccatccataaagagatctgatgccctcttctggagtgtctga 

agacagctacagtgtacttacataaaataaataaataaatctttaaaaaa 

auaacacctataacttaaacttatcaataactttaac 

cttcctagttacc<^ttctgctttctgtttgtatga 

tcttaatggaa<x1acagtgtttgactt^ 

gatgcctctgactck^tccctc^ 

tttggtgctgtacatatagctoagcgtttc^ 

ggtttctgaattk^atccccagcacaa^ 

aggcotattttta<^<x:tggac^ 

tttgcttgtttcttctgtcagtgga 

tgttgtcaggaatattgttlaacatgagtgaatatacacccagaagtacaa 

ctggatgtggtaattctatgagtgttttgt^ 

ttgtttccatacaataaattacatttcct^ 

tcaatttacatttattttctgag^ 

acataaaaaatagctcattgtagttttggtatttgtat^ 

tggtgtgattatctttttatattct^ 

tttggaaaaacacctcttcaagggtttta 

ggaacttgtgcagacca<xx:ttcc^^ 

TTTCCATTGCTGTGAAGA0GCACCATX3ACCAGAGCAACTCTTACG 
CATTTAATT<X5GGCT<X5CTTACAGTTTC 
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ccactcagaaggcaaj^ 

AGGTTTATTTTTATTTTA^TTTATATGTATGA^ 

TCTGATGCCTGCAGAGGTCA^A^GAGGGTGTTGPAw^o^o 
G ™CAGATGCXroTK3AGCA^A^^ 

TTCTCTGTTAG<3GCAACAACTGCTTTTA^CCAT 
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ACTTGTTTCATTTCCTTAGCA<X!TTGGCTA<3ATTCTTA<K3ATTTTAGAA - G 
GAGCTTATAGCAAAATACCACAAGTGAAATTTACTACTGGCTTAGTCATA 
AGCAAATATTGAAGGCTCAGTCTTTAAGGGTATAATTCATAGTGTTCTTT 
TTTTTTTTAAGTAAACAAATA<3CCTGTCATGGTAACTATCGCTGTAGTGC 
CATTACTTGTGAGAGATGTCAGCTCAAGGCCAGCCTGGGCTACATAAGTA 
AGGGAAGACCAGCCTGAGCTATATGGGACTCTATCAAAACAAATAAACAT 
TGTAGAATTTTTGTAATACTTATTA<3AA<3GTA<3CTGA'K5ATCATGAGAGT 
CTTTAGACATTTCTTCATTCCACTGTTTTGTGTGTK3TGTGTTTCATGACA 
GATTTCTTACTAGATTTATCTCTTTGTGTGTGTGTGTGTGTGTGTGTGTT 
TTACAAAATGACAAAGATTTTA<3TCCTTCTCGTGGAAAGTAGTTGCTAGT 
GGTCAGCAGATACTTGCTA<3TATAAATAAATGAGCATA<5ATCTCGGCTTG 
CAAAGGAAGACAAA<3GGAAAAAAGGTTTTCTTGAACATAATTCCTACTTT 
GTGAAAGAAACTTCTCATTTGGAAATTACATTTTGAAAATAGGTATTGTG 
AATC TTTCC ATTG TGGTTTG TGG T ATAACTATC AAATAACACTTTTTTAA 

AAAGAAAAATCTTAATTTTCTAAGATTTTTAAATACCCTTTTAAAATCAG 
CATTTCCAGCATGGTTTGATTAATTTGTAAAATGTAAGAATATAGTATCT 
AAGGCTACAGAAATGACTCAGTGGTTAAGAGCACTGGCTGCTTTACAGAG 
GACCCAGGTTCCATCC<XIA<5CACCCTCATGACA<5TTCACAGC-CATCTGTA 
TTTCTAGTTCCAGGGCATCTGATGCCCTTCTCTGATTTTCTCCAGTACTA 
GTGACACACAGCATACATTTK3AACAAAACCACTGATACACATAAAATAAA 
TTGTTTTCAAGAAACAATATAGCATCTAATTAGCTTACAAAACTAATTAT 
TTGTTTCTGTACTAATTACGTTTCTATTGGCATGACTAAGGCAACTTATA 
AGAGAAAGCATTTAATTTGGGGTTCACACTTCTAGTCCCTTAGATTCTAT 
GAGCATCATK3GTAG<^AGTGTK3GCAGTAGGCA<3GCA<3GCATGGTGC'I ! GGA 
GCAGAGGCTGAGAGCTCACATTTGATTTTCTACTAGAAGACACA<3AGAGA 
GCTAACTGGAAAAGGCATGGGCTTTTCAAACCTCAAAGCCCCCCTCTAGG 
AACACACACCTCCACCAAGGCCATACCTCCTAATCAAACAGTCCTAGCAA 
CTGAGGACTAACCATTCAGAGATAGATCAGTCTATGGAGGCCATTGTCAT 
CCAAACCACCACAGGCCCCAAGAAAGATTTGTTAGTGAAATTTCAGTGAA 
AACTAAAACAGCATTAGAATTTACCTGGCATAGCCAGCAATGATCTCTTC 
TGTTCAGTGCCACAGATTTCTTTGAGTTAAAACTCAGTTGTTAAAACCAA 
AAATCAAAATGTAATTGGCACTTTAAATTGCTATAAGGGGAAACAAGGTT 

TTCAAAGCCATGAAACCATATTCAGAATAATTTTAGCGAGAGAAATATTT 
TTTCTTTTTTTTGTCGTTTCTTTTTTT^ 

ATTTTATATTATTTTAATTACATATTTAATTATTAACCATT^TGACAGA 
GGGC AAAAGG TG AGG ATCTTCATGG AACTAATATCTG AT AAAGC ACC AAA 
TTCTTCCCAACTCTGGGATGCAAATGACAGTTCA^ 

TGTATTGAAGAAAATTGACAAGAAATGTCATGTCTTAACATAAGCATGGA 
TTTCTTTTAAGATGTAGAATAGTCTATAATTAATGTTTTTCAGACTAGTA 
AGACCT<3ATTATTGTTGTATCTTAAAATCTAGAAt3GTACTAACAATTTTC 
TAATGTGTATTTTTTTTTTCATCAGCAAAACA£3GGATT1KMACA^TTCA 
TCAATGCCTCCAAAAACTTCAACCTCAACATCACCTGGGCCAGCAGCTTC 
CCAGGTACAGACACACCTAGAGAGATGGATTGGCAAGTTTAGTGTA^GA<3 
TTGGGGAAGGAGGCTCTGAA<3GCTGGTGAGTGAGTTCAGA<3CCCAGCTCT 
GCCTCTTAGTAGCCATGGCAGCTTGAACAAGCCATGCTTGAACAAGCATG 
TACAATTCCCTCTCTACCTTA^SGCTACTCAGAGTGAGGAGTCACAGCTCT 
'ttCCTCCAGCGTTGCTGGTTCPtf&TTGGTVGGfiTGGCTQCTC&C'K^TTT 
GCCACCACCTTCCAX5CACTATGACTATCTCTATGTTTGTX3CTTCACAGGG 
GAAAAACTAAAGTGACTCATAGTTTTAAGAAAT<3AAAACTCTTTAA<5GGA 
AGGGGGATAACTCTAATATGTAX3AGGTATTCATACTTTGGGATAACTGCT 
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gtctgtttgcctttgo^ 



^C^ACATCTACAAATATGT^^^ 

^^^AA^AA^^J^^TCTTGCC^AA 0 



GTTTGAGGTTTGCCTTCATC^^^ ^^ A ^ TAC ^ATTATAGA 

actcatgttctta?c^^ 

^TC^AGTCTCAC^^^^ 

gaaagaaggcctgag^^ G ^^^^ gtg gtagcaggtagca 

GACCTTTCTGCTTAGGA^^^™ ACC ^ rcCATCA AGACTTGAG 
^ATGTGCAAAATG^TCCA^^^^ 

, ^ , GGCCCATCAAAATTCAGOT^ G o»^^^^^^^^' A ^ , ^ ■^GAGTAATTTCAC' 
^CTGAGTA<X^GGTGAGTCCrcS^ 

CTC ATACAAAGCGGG AC^ACGCAGOn A ^ G > A ^ A ^ t ^' AG TTC A' r GTGCAGCCT 

atgcagtgtgtccctg^^^^^^^cctcagcact^a? 
tagaaatacacagttttaaS 

ATGp TCTAACCA ^J^™^^^^ 
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GTACTK3TGTAAAAACACATAGCTTTACTATTGCACAGACAAGTGTGGATT 
CTTGTTCTGTGTGTGGTTCATGGAGGCTCTC 

GCATGTGTCCTGAAGGATTGGTCTTCCTGCTATGAC 

GCCTGAACTGAGTCCTAAGGAGACA<^TAGTGGAAATGTTTGTATTGCAA 
AGACAGTATGGGTAGTTGTTTTTA<3AA^ 

GAACTTGTGATCAAGAAGCTAACAGCTGGACTGGGATGTAGCTCAGTTGA 
AAGAACGCTTGTCTAACATTAAGAA<^CCTGGGTACCATCACTACCACAG 
CATAAACTGAGAGTAGTGACAGACTCATGTGTCCCA<3CACTGGGAAGGTA 
GAGGTAGGAGGATCAGAGGCTGCCCAGGGAGGTTCA<3AGTCACTTACGCT 
AGGAGATAGATCTAAAAATGAAAAGGAAAAAGAACTTGGTAGCTGCTAGA 
GCT ACC ATG AAG AG AGTGG AGCTTAAGG ATTC AGCTG AAG AATGTAAAC T 
GCCTTCTGATGACAACTGAGAGTCGCTGAGTTATTTAAAGTCAGGAAGTG 
AACAAAGATCAGTGTTTCAGAAAGACCTCTGTGG 

AAGTAGCCCCTCCTATGTCAGGTACTGGTTTAGACTGTATTTTC 
CCTCTTTCTTGATGGCCCTCAGACACCTTTCATGG 

TGTACCCCATAGCCACACACTTGATGGTTCTTTATTACATAAATAGCTCC 
TTATAGGCAATGATAGATTTTATATTT^ 

ATGTCATTGCATAGAATTTAGTAGTTGTAGGTACTCAGTAAATGTATATA 

GGATGAATACAAAAGCTTTAGGGTAACAGTATTTTGTTGTTCT^ 

CATTTTTAACTATCTCATAGTAGCACA<5ACTAACCCAT7UVCTGACCATGA 

AGCCAAGGATGACCTTGAACTCCTGTACCTTCTACCTCTTCCCCGAAAGT 

GCTGAAGTTACTGGCATGTGCTGCTCACCCAACTAAT^^ 

TATAAAGGTGCTGATCCCCTTTCCC 

AAAGCTCTTTATCCCAACCCACAGTGTTAAAGAGTTTAGTTAA 

GGAAATTTTGTCCCATVATGAAGTGGTTGA'PGGCAGGCCTGGT^^ 

CCTATAATTCCAACACTCAGGAGACAGAGTCAGGACGATGGCCAAGAATT 

C AAGG CCTTGGGCCTACAGT^TAG AAG AG AG AAG AATG AGG ATTGG AAC A 

CCTGATTAT^TAGATACCATTTCCTGCTACCAACCTGTGCCTTAGCTACT 

CTTC T ATTGCCGTG ACAAAACATC ATACCC AA<3GC A<3CT^ 

GCATTTAl^AGGACTCACAGTTTCAAGGGTTATACTCCAAAACC 

GC<TGGGAGCAGGCA<X1^GGCAGGAACATCT^ 

GCTCACTTCTTTATCCACAAATAGGAGGCA^ 

TAGAATGAGCTTT(X^GACCTCAAA<X^^ 

ACCAATTGGGAACTAA<3TATTCTAATCTX3TC 

TTATTTAAACTACK^CACTTTATAAGTTAATACTACATC 

CTGGTATGGGAATTCTGAAAAGTAGT^ 

GTGAGTAGATGCTAGCATGTGTGTCAGGAGTGAAG 

CTGGTTTGACTTCTCTCCAGAGCTGA<X5TC 

AAACCCGTATTAAAGCGGTGGTAGTTACTGAA7ATCAGTGCAGGGCTGTX5 
GTCTCAACACAATGTTTGAAAAAGAAAACA<3<XX1A 

GTACAGCTGCTTATAATTCCAGTCCTCTGGCCTCTGCTCACATG 

CCCCCCC^TACATACAGAGATGATTAAACATAA 

ATGCTATAAAAATGGAAAGAGCCGGGCGTGGTGG^^ 

cagcacttgggaggcaga<ktca<xx:ggatttc^ 

gatctacagagtgagttccactagagct 

tctcgaaaaacaaaaacaaaaacaaaaacaaaaaaaaaa^ 

ggttcactgtttcacaggaaaactctgagaggtgataat 

ttaaaatatactccatagtgcacacagcctc 

ggcctgtgagaagactk!!agtcctctcctggcttc^ 

AAAACTCTTCTGCAAGATCCACAT^GTC^ 



BNSDOCtD: <WO. 



.O005373A2.l_> 



WO 00/05373 

43 / 89 PCT/US99/I6484 

GCAGAAAGTCAGCCAGGTgS^ 
ACACCTCCCTGAGGCTTTGTGGCTTCACCTPirT^m^^ 




GTAAAAGCAAAGTACATATTTTTAAGT^aa»^^3^^^'^ , ^^'^^^^-^'^A 

atatatagaaatctagacJtcag^a^cc^ 
actagctgcttctcttaga(k:cc^^^™^ ctc '^agac 

atcctcactcatatgtcacaag^^ 

CAGCCTCCCCCACAAACcd^^ 

WAATAAGCCATCCACrcTA^A?TC^^ ATATOCCA ^^CT 

CCTGGTTTTGGTTTGCTTCAGtGTA^ 
CCACTCTTTGCTGACCCTCCAPJ^ 

AGGGGACGGAGCAGGCAAACCAGA^^? A ^ CCa ™ CTCC TATTC 




tattttagaa<x;tggata1£<^ 

GATCAGAAGTCCAATGTTA^^^^ 

A ^ aaa ^ gt ™agtata^ 

AGCCCCTAAATTCAGAACACTTWa*^^ 
TTTCAAACTCTGACCAGcS^^ 
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TGGCACTCAACTCATGTCAGTGACCTGAGAGGC^CTTTA^ 

gcaccctgtgcccagcctaaaatttgacctttgcaa<^tttagtgtgtgt 

tatctgactgtctgagtaaggatgacaaaatgaaaccaaacttatgggat 

aaagcttggtggttgtatcagtacatttttattgttgtg 

tgaccaagacagcttatagaagagtttatttgggtgtatagttccagaga 

ggtaagagtctgtcctgacaaggaagctgtggcagcaagtggcaggtatg 

gctacaggagcaggaaggagaaaga*x:aaactagaaacagttga<x3 

ttaataggaaagcccactcccctaatgatgtccttccgctagcagaccac 

aagtcctaaccctccctacacagcaccacca<x:tggggagttc 

tgggactgcaggggacatctcattcagaccacctcagtcggagaatgctt 

gccttcatagtatgtgcaaggccctaggttcaattct^^ 

gaacatgaggaaagaaaagaaggtgggagagagtagagaaagaagagaag 

aagaggaaaaaggaagggaagggggagacagaggaaagcagggaagcaga 

ggagaggagaagagaaagaaaagattaacca<5cctggttttt7^atagcac 

ccctcccactctcagtagttcccaatttgagc 

agatatttctgggtgggtgaccagtgtggtcataaac 

ctctccgtacaacttgtgattatgaacttgttagatgatcagcttcaaca 

ggagagggcctcctttagtctcaggtgccccctccagccacgctgggact 

cgcagcctctctgtgatgagacacaggacattaactggtatggttctgct 

ttgccaaaacgtcagtccatggttgaactctccacaatc 

ttgagaatcattacatggcatca<xx:aagccaggactc 

aaagggccaggagcatccgca<^ttttggcac<x:a<3tactaactagta^ 

agcacctcataggtttctttaaaatgcaaacactaaggt^aaatctaactt 

ttttttatttattaaggccattcattttatttt 

atacatatgtaccacatgcatacaaggtcaaaa<3atagtattgggtcttc 

gaactggaggtacagatgattgtgagctgccatgtggatcctcgaaattg 

aacctaggtcgtctacaagagcaggaagtgctcttaacttctgagccatc 

tctccagctccagaaaagctactcataaaatstcaaatctaa^ 

ctggtgatgtacacctttaattgtagcacatggaaggcggaagtaggcgg 

attcttattcatccaaggccagtcttctcttaa<iagtg 

ccaaacccgaaacctgttactttgcactttagagtataagtgatagagaa 

aagacagagaaattttagaatctataccttaaaatagcttatggcttata 

tgatactgttgggaccatatttacttatc^ 

aaaaaaagatggggggggagctgaa<^tctcctttctat^ 

tctagctat/vaaaagagtaagaggcatgag1x3tgtctcagtggtagagca 

cctgcttagcttgtgtgggattgaatgatc^^ 

gtggggcaataaatttaggaaaataacatgctaatcattgact^ 

tttttttaaaaaaagttattattttatgtttattk3 

tctatgtgtgtttttgatgtctgct^ 

cttcccaattgttatcataactaccatcttc 

acttagt7^ttgtttcattc<3aatagatactgaacactcttaatctg7^aa 

ctaatgctcagaaagttccactttgccaagcaaggaggat^ 

tataattttagcactgggagggtg7^ggca<5aattctg 

cctgagcttttgagatcctgtctcaaataaaattaaatt 

agattttcagaataggtgtgttca<3ctgctgaataaatctaa<k^aaatat 

ccccaaagaaccctgaaatctgaaacgtatta<3ttctaa 

tgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtctgtc 

taagatagattctcactatgtaaccctakxttt^ 

agagagaccaggcatatgctatggtgcc1kkx5agtggcaaacgttttaga 

tgaaagatttcagttgtaccattatcttcctaatgagsgctctggtctag 
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^AT^AGAAAGAATTAGGC^AtS 
CCACAAAATAATGTTACT^^ 

TCCC TAC AAAG CTG CTTTTTC A PTT^^^r^^^ ^ < ^ T ^ ,< ^ ,r ^^ TATA 

gataaaactctccgJt^*^^ 

^TCAAGCAGTTATTAAT^A^^ 

taatctgtcttaatcaa^ 

C^TTTACTTOCAAACTC^ 
TTTTGGTGTTCTATAACTGCCTTArA^a^ 



AGGATCACAGTTCCT^GGTTAGA^TCCATP^nin^^^ 



a ? a ^^a™gataaS^ 

-^TTTTTGCCCTAACAAATTTTTGTTT^ 
TCTGCGT<X:ATGCATTTAT™£^^ 

CCAACCTCACAATTTTT^AA^ 
*<3TT<^^AATCTGAGGTe^^ 
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CCAAGGTTGTCCTCAGATCTCCACACAGATGTACCCACAATTATACAAAC 

ACTCAACATAAACCTATTTACACACCCACATCACACGCACACACATACAT 

GCACATACAAAAAAAiKSCTTTTTGAAAGAAGTAGAGAATGCTAGATATGG 

TATTACACGTATATAATCCAAGCCACTCTGGAAGCTGAGGCAGGAGGATT 

TCAAGTTTGAAACCAGCTTGACCACATAATTATAGCATGCCTCAAAAATT 

GTATAGAGAATAAGAATGAATATGAATGAGACTAAAGTCATATCTCAGTT 

ACTTTTCTATTGCTGTGGCAAAACACCATGACAAAGGTAATTTACAGAAG 

AGATTATTGGGGCATATA^TTTCAGAGGGTGAGTCCATGACAATTATGAT 

ATGGCACTGAAGTAATAGCTGAGA<5CTTAAATCTGGTCCACAACATTAGG 

CAGAGAGAGAGCTAACTGGAAATAGCCATGAGATTTTGAAACCTCAAGCC 

CCACTCCTAGTGATGTCCCACACCTCCTAATGGTTGGCAAACAGTTCCAT 

CAGCTGGGAACAAGATATTCAACATATAAGCCTATGGGGGTCATTCTCAT 

TC AAACC AC C AG TAG TAATTATTAG AGCCCAGC AAAG AAGG AAGGG ATAG 

AAAGAAATGATTGATGGGAACTGGGGTGAAGTCTGATACAGAGAGATCTT 

TATGTACTGCAGCGTAGCTCAGGAAGATAACTATGGTTAAGGACAATTAG 

CTAAGTGATTAGTAGAGAGGATTTTAATATTTCCAATACAAAGAAATGCT 

GCAGGCCTGAAATAGGGTACGTTTCAGTGACCCAGATCTGATTATTACAA 

CTCATACACTTGTACCAACCACATAAATATGTACAATAATTGTGTCAGTT 

TTATATTAAATAAAAATGTGGAGCAAGTTAAAAAATGC<?rGTTTTAAACT 

GATCACAGTTATATGCCAGCTTTTCTTTGCTCTGAGAAAATACCATAGGG 

AGTAGTTTATAAGGAAAGAGATTTCCTCCAGCTCATAATTCCAGAATTTT 

CAGTCTAGAGTCAGTTAGTTCTATCATATTGGGCCCACAGCTAGAGCAAA 

TACAATGATGGGGAGAATGTGGTAAAGAAAAGTATTTACCTCAGAGTGGT 

CAGGAGGAACACAAGACAAAATATACATTTCAGTCCCATACCTCCAGTGA 

CTTGCTTCATCCAAACAGACGCCACCATCCAATAGCCATTAAAATAGAAG 

TCAACCAGTTGATTGACATCCATTGATCTTAGTCATATCCCTAAATTCAA 

CCTCTAAGCTCTGATGCTCTGGGGGCCAAGCCTCTATTGCATAAATCTCT 

GGAGCATATTTCATAATATGAAATATTAAACIAGGTCTCTCAGGAGCTGTT 

TGGTAGACTTAGTTGTTTTTTTTTTTTTTTTC 

GGGTTTTGTTTTGTTTTGTTTTGTTTTGTT^ 

TTCGAGACCGGGTTTCTCTGTATAGOCCTGGtKMTCClK^AACTCACTTG 

TAGACCAGGCTGGCCTCGGACTCAGAAATTTACCTGCCTCCTGCTGCCAA 

GTGCTGGGATTAAAGGTGTGGGACACCA<rTGCCTGGGCTA<3ACTTATTTT 

TTTAATCAGATTTCAGTCTTTGCCTCTGGAATCACAG^ 

TCAACACCTAGTTTACAGAAGAAAGAAAAGCCAATTTTTTTTTTT 

CATTAGACAACTAGAAGTTTTCCCTCCTATTAAGAAAACATATTAACQGG 

CTGGCGAGATGGCTCAGTGGGTAAGAGCAGCGKSACTGCTC^TTCCCAAGGT 

CCAGAGTTeAAATGCCAGCAACCACATGGTG<^TCACAACCATCC?GTAAC 

GAGATCTGACTCCCTCTTCTGGAGTGTCTGAAGACAGCTACAGTGTACTT 

ACATATAATCAATAAATAAATCTTTTTAAAAAAAAAAAAAAAGAAAAAGA 

AAAAGAAAACATATTAACAGTATTGAGAAAACTGT1K3GOTTAAATTTGAT 

GATTTXSAATTTTATTTTACTAATAAATGCAT^TATTGCTGGGCATGGCAG 

CACATCCCAGCACTCAGGATTGCGAGATAAGAGATCATAAGTCCACGCTA 

GCTGGAATAGCAAAATAAAATCTTTTTTTAAAAAATATACATACATACAT 

ACATACATACATACATACATACATACACACACACACACACTTTTCTCAGT 

AGTACGGCCAATTAGTTGACTTGTCTAACGGAGGGA<^AAGAGGAGGCA<3 

AGAGCATGCTGTTCAtSATCACGTTCTCTTTTGCATTCAGTCTXaSGAOCCC 

AGCCCATAGTCTGGTOCTGCCCACATTGATTATTGGTATTCAGTTAACCC 

AGTGTAGAAACTCTCTCAGAGACATGCCCAGATGCTTGGCTCATAACCAC 

TGTGTATGTATATATGCTTACAGAAAATATACTCATCATTACACATAAAT 
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TACCTGTTATO3CTCA^^^^ Am ^WACTTGT?rrT 



TACCTGTGCGGCATCA< 
GGAGGCAGGGGTGGAG' 
rTGCCTTTCAGTTGTT^ 



^TGCACTTCCTGTGAGCC^CCTC^ 

^S^CAAGA^^^S^^^^AC^^ 



catatgagaagcatg^^a^^JZ^agtgatgactaaS 
^^attcaagtc^cI^^ 

^CAAGTTCAGCTCAGTGAC^A^^^ 

^caaacctc^g^ 

AG AG TGGG TA ^TAGT AG AC A^^TGCCA^G^Tn^^nT^^^^^^^^'^'^^ CAC T 

ccaggcactgatccaggtggtotcS^^™ AGCACCTc 'TCcatc 



^AGAAAGTATATA^ 
^^AAAATATTIX^GTATC^ 

accccctcaaaaaagga^™^ 

CTAATAAACTTTTTOTAC^^ 

ACCTTATCTCCTGTCCTCT7^?^^^ TCA ^^ G ^TOACCC 

gactgggtttgctgagttaa^a1T^^ aga ™^ a cagaaccca 
^J^acacatacta^ 




tggtttgaagc^ggccagtttgtctt^ 
? c ^tattattcatggtata^a^^ 
a J^^;P^ t atagtatctc^ 



CATTATATCCTCCACT^AACrTTTATT^T^T^ 
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T AAAAA T A T AAT AG TG TT ACC ACATG T AAC AAAATTTG ATACC TTGVGGT 

ACCTAGCACCTTGTCATGTGCAGTTTTCCTCAGCTGTC 

CTGCATCTGATCAGTTTGAATCAGAGA<5AGTGTAGCATGTCT^ 

GTATTCACTAATAAAATCTCAGTACTAAGCATATTAATAATACTATATTA 

TTCATTAGCAACTTCTTCGGGA<3ATGCAACAGATG^ 

GCTTCTGTAAACC TTGCCTTGG AAAC AG ATG AA<3 AACCTCCTG ATCTCAT 

TGGGGGAAGTATAAAGGTGAGAA<3TGGCTCAAA<^ 

AGAACTCAGGCCTCAGTTTGCTAGGCTACAGACA<3CAA<3CG 

TCACTCCTGTCTCCTCTCTAACAX3TTAGTCAGCA<5AAG 

ACCGTAAGGGGCTCTGTGTGTGGCTTTACTTTTCGAG 

GATTTTAACATGCAAATTAAGCTTGTTATTCT^ 

TTTATAGTTTTTATTTGGAAATATCTAATCTGGGCTAGG1X3 

ACATCTTTAATCTCAGTTCAGAGGAGGCAGA<5ACAGAGGCA<X5 

TCCTTGAGTTCTAGAACA<XTTGGTCTAC 

AAAAAAAGAAAGAGGGGGTGGGGGAAGGCAX3OTAACTOT 

GAACCAACACACACACATTTTGTTCAGAGCCCCAGTACTCAATTAAAAGC 

CAGGCAGGCATGGTAACA<3TACTTAGGGA<5TCAGAAACAGGATTCCCAGA 

GTAAGCAGTCTGACTA<X5CTA<X:AGGAAAT^ 

AGAGGCCCTGCCTCA<3TAAGTAAATTGAAGAACAACTGA<XX5AGACTTGC 

ATGTGCACTTGTGCATGCACCCACACATGCACTTGCACACATACCATATC 

TCACCATGCTTAGACTATAAAATGTAGTCACTACTGGCAGCACATGCCTA 

CAATACAGATGCA<^A<5AATCACTGCAAATTTGAGATCAGCCTGGGCTAC 

TGGACAAGATTTTGTCTCAAGAAAACTAAAACAATAC^ 

GGGGGTTATTCTAATGCCAOTGTTTATGACAGCACATTCAGAACTGACAG 

TAAAGGCAATCAAGGACTGTCAGTGGTGGGTATATACATAGGCAGAGGAG 

CAACTGCTACTAGAAACTGTTTATGCTTTAAAAGACTAATGTA1K3CTGCA 

GCATAGACAAACGTTAAGTTGTGTTAAGTAAAAGATGCTGTATCATTCCA 

CTTACCCATCGAGAATAATCAAATACAAGACAGAGTA/^AATAGTGACTGC 

TAGAGGCTTAAAAGAAAAGACCAGGGGGTGGGGAAA<XX3A<5^3AAGGAA^ 

TGGGAGAGGGAGGAAGGGAGAGAGGGAGGGAGGGA<3CCAGACTTTGTGGC 

TTACAGCATCAAGAGGCTGAGGCAGAAGGGTTACAAAT^ 

TGGGCTACATA<3TGAX3AAGTAGGATTTCCTTGA<X:TC 

TAATCTCTCATTGGGGGAAGTCA<X^ 

GGGAATGCTATTTGCTGGCTCCTTCCCAGGCTCCTC^ 

CTCATTTTGTTTTTACTCTCTATGGGTC 

GTACGATATACATGCCTGCTAeG<^C^ 

AGTTACAGATGGTTGCAGGCriXXrCCTC 

GTCCTCTACATGAGCAAGTCTTCTTAACCATTGJ^ 

ATAAAATTCTTTTTTAAAAATAAAGTCTGC^ 

TAGAGCTGAAGCATTCAATGAGTGGATAAAGAATCGATTTGATC 

CTACCTTTCACAAGCTCTTAACCCCTAC^ 

AAGATGAATGTAAAACA<K3TAGCTCTCTCGATAAT^ 

GCCAGGTGTGCAGAACTGTGCAACA<^TC^^ 

CTTTCCTGAC^CTCACACAGCTCTK^^ 

ATTGACCTTATAAGCACCTGACTGTGCAGTGTA<^ 

TTCrreTTTTCTTCCTCCAGACCGTTCCTAAGTO 

GCTTTGGTAAGAAAGOCGCA<3TCCTCTC^ 

GGACTGGGAGGAATCCCTCCTCCTGGTCAGTCAGGTGAtSTAGACAGGAGA 

CAATGACAGATATTGGTCTG1K3AAGGACTGAGTCTTAG 

GTATAGAACCTGGGTClKK3GCACA<5TGCTTAGTGGTACAGAGCTTTGGTG 
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GCTCCTATTCATCTCCATG^^ 

GGTACACGCCTTATGTCCTArrarTV-^^^^i CAGGCAGTGG T 
TGAGTTCAAGACC^C^ra^ 

ctacatagggaj^c?^ 
cccagattgtcttaaa?^ 

GG ATATTTTG AAAATTA^TA^T^CTACTGRr ^^j^ATG AAAAC ATAAA 

tttcattgaaataatgatac^ 

TTTTCAAACTGAcl^AA^^ 

ctctaccagccaacct^^ 

CATTTATTCTTCAGCATGTTTTTTTTr^^ 

caggaaacaagtaccagtggtgtt^^ 

' I ™ACTATATACTCTTTTTre™£^ 

taaaatttatttactativ™ 

^AGAGAC^ATCAGA^ 

ggaattgaacccaggtc^?g^ 

T '* , ' r ' r 'rcTTTTTGGGTAT^ 

TCTTTTATACATG^rA^r^^ TAACTTACT ACATAGTTTGCCAAT 

a ^agccag^aS^^ 
tttaaac^caaaggcagtatcaIaS^ 

CATATOAATGCCl^CACAGTOT^ 

GGCACACGCTCAGAG<^TAGC^rA^ A ^rS^^ CA ^ CA ^ AGT ^ 
CAGAGCACACGA^TCA^^^^^^CCTGAGG 

™^TGTATAAATAAG^^ 
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G AAATGTAAAAAG AAATCTG AAATTATTTTGT AAAAAAG TAAGC TG CTTC 
ATGACACATGTCCCCTCTTGTGGGTTCTTCCAAGGTCTCGC'FGTGGCCAG 
TGCCCTGGTGGACATTTCTCAGCAGATGCCAATAGTGTACAAGGAGAAGT 
CAGGAGCTGTAAGAAACCGGAAGCAGCAGCCGCCFGCACAGCCTGGAACC 
TGCATTTGATACTGGGGCAGGAATTCGCCCTCACAGAGGGCGTGTGGTCC 
ACGAAGCTGTCTACAGGGGAGGCTGCAGGCAGGAAGCAGGCGTGGGGCAG 
AAGACTGGGGACCCTTGAAGCGTCCAACTCATGTGCATGATCATGCAAGC 
TGTTTTCATGGCTCACCCCTCTGTGTCCAGCATCTAACCTTTTACTTCTG 
TGTAGGAAATAATTTAATTACAAGTCCAGGAATGGTCTGCTCTACTCATG 
GGTGGAGGAGACCAGTGCCGACCCCGTGAGAGCTGAAGGTGATGCTGAGG 
TCCCTTGTGGAAGCCTCTCTTGGGAATCTCAACTGCAGAGGAGCTGGCCT 
CTGTCAGCAGCTGTCCAGCATGGTCCTCTGACACTCCTCAGATGAACTGT 
TCTCATCGG AAGCTTGCTG TCTTTTTAC AAG ATGAGCTTTT ACTCTCTTC 
CAGGAAGTAGCTTTTTTTCTAGCTGAGAATTAATAATGGTCTTTCTCTTT 
GGAAGTCATATCAAAGTATAATTGATGGGGGCCTTGTTTTGTTTTGTTTT 

ggtttttggagacagggtctcactgtgtagtcctagctggcctggaactc 
actatgtagatcaggctggactgaactcacaaagatccacctgcctctgc 
ctcacaaatgctgggataaaaagcatgaaccaccaggcccagcaaagagg 
gctattctaaatgtcaaggtcaatggagttagaatatatataaaaaaatg 
caattgataattctctatagaaacttgattaattttaatccattctttcc 

TTCTCTTTCTCTCACTCTGTCTTACACACATGCACACATACACACACACT 

aagtgcctagactttgaatagatctagcaattggacattagtaagcctaa 

gtttttacatgattgcattcctacattcttgtaaactttaagtaactacc 

attgcagtttgttctttttttaaagtctaatttgcagccaagaaggagta 

attctcaccccaagcaacatctaatagggactgagtgaccccagcccagc 

ctagtgtcactttaggcctgacgtttgagcaaccctcggctcttcccaag 

gcaccagagaatgcacttgctcatgccctgtgcctcttgagcagaaaaga 

gcactgacaactgggacacctggctctgtcttcctacagctgctcgcact 

gacctgtgggaacctgtgggtcatccccaggctgaatggagtacacacta 

gaagagggatgatgcctagcattggggcagcatctgctcagcacatggaa 

agggacctggttccatctcccctgggcaggagttggtccagcctcctccc 

agacccagctggtggctgtgaggaggtggggaatgctaatgagaatgaaa 

agcacatgggttgatgggaagggacaagattaccacgttaggagggtgag 

cagccctcoxx:tatgtgcccaggacccikxx:tggacattgcatttc^ 

tttatggt\^tccgtattctggcattatg<^g<^ 

ctctccttcttcatgtcctacagttctgctatcacctgactagaatagcc 

ctctaggc aacag tgctcaaatgtatg agtttgg ag aagttaacaatcag 

AAGAACAAAAACTGTAGTGTTTCAGCTTTAAATGCAGTGTTGAAGAGGGA 
GCCTTTCTCTAAGCCCTGCACTAACCCACTCCTCCCAAGACTCTTGTQGA 
GTGACAGTTCCAAGCTGAACCATAAATCACTGATGCACAAAACAC'PGCTA 
GAAGGCTCACCTCTCAAAACAOGACTCTTTGCATCACTATTAAAGAGCAG 
AAAGTTCTAGAAATGATCCCAGGCTCATCGCCTATACAGTTAGGAGCTCC 
CCACATCTCTACCAAAACCCAGCACATAAGTATCTGGGTGGTCTAGCCTT 
TCATCTCCGTAAGAAGCCAGGGX3ACTCTTGGCCAAAA<5AAAGAAAGGGAA 
GTTGCACTAGGGCTTGTCCGTCCATAAGGAATTGCGCTCTGCTT1<5CTCA 
AAGGACCAAATTTCTTTGGK^CAAAGAAGTTGCTTCTATGTTAGTCCCATA 
CCCTGAAGTAATATGTACCATGGCTCCCACCTACCTGTTTATGCTCTCCC 
TGCCCCCAGGGAAACTGTTTATTCTTTCAAAAGAAGCAAACAGeGTTCAT 
TTCTGCTCCTCTAATGGAGAAACAGCCAGCTCCCCTGCATGCCTTAGAGC 
CAACAGCTCCCTTCAGGCTTAG AGCAGGGGGAATGGCAGGGATTAAGAGC 
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TGTCATCTAAGCAT-TXTAG^SS^r™?2^^ AGGGTGACACA CGGC 

ACACTAGTTCA^^^^^^^^CTCTTGGTCAGG 

CTACAGTTCCAGATAACCTGACACrcJc^^^^^ 0 ^^^ 
TGGTGGTCCACCTGGTGCACAGAPAaa^^^^o^^^^^^^^^^^^^^G 

ACCATGTTCTTT^S^^ 
ATAAATAGCACTATTTTCCC^^^^ 

TTCCCTACACAGTC^GAG^C^^^^ TCCCCGTCCTG ^TC 
ACACGAAAGCAACCCA^CC^G^?^^^^™^ 

AGCAAAAGGAAAATAC^AA$S?^fAlSS^^ GAAGAC ^ 
^TCGTTCKSGGAATTGGATCAAG^ 

GAACTAATTCTGCCTCTGGAAATCTTTTCTG^ 
C™^TTTTAAATGCAGATC^£^ 
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CACATGTGCAACAGCACACAGTGGAGGTCAGCACAACTCTTGCX3AGTCAA 

TTCTCCCTTACCTTGTAAGACCTAGAATTCCACATTGCCCAGGCTCTGAA 

AGTTAGGTTGGGTCCACACTGGGCCATGGCTGATGAAATGTTGGAAA 

GATAACACCAAACTTTTGCACAGAAAATATTTTCATCT^ 

GGAGTTCACAGGCTAAAGTGTTGGAAGGAACATGGGTCCCTGAGCCACCA 

CTTTCACAAAAACTACCTGATCAAGAAGAACTATTCTGGGTTTCT^ 

taaaattccttcccagagagaaatgtaagcaatgtctgccgcttc 

tcccagcaagaaaccaaggcacjvkttccaccaaagttcact^^ 

gtgagtttattgggcttccgtgcagaacatacatgaggggt^ 

AAGTGTGGATACTCCTCCCCCTAACAATCCACACCCTGAATlAAGCCTTAC 

ccagcagggatgagggcttccccagacccagattgatggtgctgccattc 

catttttccctggcatgcaaagagatagacagaaaaatagattatatata 

atatacacataaattagaaaaatagattatataatatacacataaattat 

atattatatatataatatataatacacagatagattatatatgatatata 

tu^acacacagaaatagggtatatataatatataatacacaaactactcag 

ctattaaaaacagtggattcatgaaattcttaggcaaatg^ 

gaaaatattctgagtgaggtaacccaatcacaaaagaacacacatggtat 

gcactcactgataagtggatattagcccagaagcttggaatacccaagat 

accattcacagaccacatgaagctcaagaaaggaagatcaacgtgtgggt 

gcttctgttcttcttagaggaacaccctcataaagtagtggtggggggtg 

gggggagacagaataggtggtttccaggagaggaggaaaacaggaaaggg 

aataaataacatttgaaatgtaaataaagaaaatacccaataataaaaga 

aaaagaattttgaaacagagggtaaaaaataatacacaaaccaggtagat 

agattatatataatatatataacacagagatagatagatagatagataga 

tagatagatagatagatagatagatagatagatagataggtcaactgctc 

gcccctccactaggtaacatgcagttaaggcagagctgcatacaacagat 

gttagggatactcaggtgagaatctcaggctttgctccatccatctatgc 

tggggtgtaagctgtcaacaagtttagctgggatgatgctttgcaagagg 

gcacagctgaatgccctaagatggtagatgcttggctcaaaggagacact 

acagctctgcatcaaggcaaactaactgagatga<3ggcctttattttcca 

gatctgtatcctggagcatcattcacctgttactacactgaaaacatttg 

gtgttggtttcatggcagatgacaggca<5tgagagaagtacagc7^3ggga 

ctgctagaggtgggggttctgtcagga<x3tgggaggctgtttggtta^ 

acttggaaa<xiaacaagtttttagctagagggagaa^ 

tgtacttgcttgatttcttaaatatcaa^ 

attttgcttgcatgtatggctatacact^^ 

gaccagaggaagtagtgtgagcctctgaaactgaagttacagacattacg 

acttgagtgcctgaaactga^cttggtcctctgga 

tcctaaccactgagctatctctccagccctgaca<5aacatcatgtactcc 

aggctggtctcaaatttgctttatagccaaga 

cctcctgttctctcaagtagtggggttacagk3tctacactg 

tgagcaaatcattacaaattgagttctaaggcaggtgtaatagttcatgt 

agtaacaatctggaattttggtctcttaaaaaa^ 

gtattttcattttaatcccaggtgtatggcatatatot 

ctgactacagcagctatgattttttcttgttctagcaga 

gccagctacagatagtttctgtgattgtgtgacatttg^ 

cttttcagatggtatataaatattagagccccaataggcagagttgatga 

ttgttggtcattcaggggtattggttc 

agaaacaagaacaaattagattcagagatctctatatctctctctatctt 
cctttctgtcctatctagtaata<3ggggtaaaag^ 
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TACATA 
GCTGGAAAAGTTT 



agcatggaaaaagI^aa^^ 
gagttttccaaaga^g^ 

CTGAAAAATACCAAAGGACAATGTGCTCA^TA^^^^f^^ 



TAGCATATGC 
3TGTCCTATA 

GCACTCTCCTTATGTGGAACGg" ATA^AaI?^" " ^"^^GAAT 



GGGTTATGAGATCACCATCA<3CTTTTrcCAG^TTTTrc * m 

GAACAGTAAAACAGAGCAAG^^^TA^A^Atrr^^^ 
AGGTGAAGGGCGGTACTCTAGTCTArA^^S^ GAACAGCTGCT 

ctactatagaacaag^^ 

A^AGCCACMAAAAAG^AS^^S^^r^^^^ 

gagaagccttcactgat^^ 

ACAAAGCCATATCAGACCCTCl^rr^™S^ AGrcTCAGCTATG A 
GTGTATGACAATCCAaISa^^ 

^ACAAGTAATCCC^^A^G^Sa^^^ 0 ^^^ 

gttctaatgtgta™^ A ^ AG J^^^ 



AACC^GAGAGTGGCATGGAGACTCAAAACCTTCA! 



TATTTCAAAGGAfiAA.^VV^^SISI^^^^TATAGATGAA 



-CTATATCA 
:CTTG<3GCA 

ACTATACACX^CAACAGGAGG^G^A^ 
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CAAAGAACAAACAAGAAAGTCATTTGATTCA<5ATGTATCTTTCTTGCTTC 

CCCCACTCTTTCTCTCCTCCCCCCGGCACCCTGCCCCCTGCCC<^AC^ 

TACCCTTCTTTTTCTATCTAGTGACAAGGATC 

GTGGGT^AAAAGAAGAGCCCACAAAGTAACTCAGGTTGGCTACAAGTTCAT 
GCCAAGAATCCTAGGACCTTGTTGTTTAAAGGCT^ 

CATGAATGTTAAATGTACATACATGTTAAGTGTATGTATGTACACCATAT 
G C ATG C AT AC AG AATCC AG AAG AAAG TAC ATTATACCCTGG AATGG AACT 
T AG AG TTGTG AG ACAGC ATGAGG ATGCTGGG AACTGAACCCAGTTTCTCC 
ACAAGAGGAGTAGTTGCTCTTCACTGCTTAACCTTTCCTCCAGCGCCAAT 
CCTAGCATTTTGGAGGCTGATGTAGGAAGATTATCC^ 

TCTTGGGCTCCATAATAAGTTTAAGACCAATCTCAGCTCCAGAGTAGGAC 

CCTGCCTCAAAAACACACAGGTGGAAAGATGGGTGGGCAATGAAGAGCAC 

ACACTGTGCCTCCAGGGGACCCAAGCTTGGGTCCAAGCACCCTTGTTGGG 

CAGCTCACAACTGCCTGTAACTCCACCTCGAGAGGATCCTAAGCCACCTT 

CTGGCTTGGCTTCATGGAGGGAACAGGTATGTGGGTATCTC 

AATGAGCAGCAAGTGAGTCTCGCTGTGGCTAGCACAAAGTATGGGCTGAA 

GAGCAGGAGGACAGCTGAAAAGTGGCCTTTCCTGGTGAGTAAGTTGGTCT 

GAGCAGCTGAGTCAGTTTCTTCCTGGCTGCTTGGCTGGTC 

TAAGCTGCTCACTTGTAAGTCTTTTGCTAGGAGCGCAGCTTGTCTA<XX3G 

TTGTCTTTGCAACTGGCCTTGTCTGACAGTGACTTTCAGCXGTCTTAGCT 

GCTTATATACACAGTCTTAGGAAAGAAGGCTGGTG/^TCTGATCCATTTC 

AGGAACTTTCTGAAGCTATTCTGAATTTACTTTACAAGGTTACCTGCAGG 

ATAGAGGATCTCAGCTCTTTATAAACATCCTGTCCTT^AAACACCCTGTTG 

TTCCTCTTCTCTTTTACATCCTGTGTCTTGAGAAGTTTGCCTCCAKSGAIXS 

GAAGTTGTTCAAT1XTAGAGGACACTGTTGCACAAGCTCCCAGCACCCACA 

TGTGAGCTCAGTGCTCTCCTTGGCTCTAGCTCTGCCCTATC 

ATTTTGTCATCATAATCTTTTCCTATATCCTTCCTTGTTCTG 

TCTGGTTCATTTTTTTGGCATTTTC 

GGCTGCCTCCAAATCATCTTTTTGCCTTAA 

CGAGTGGATCTTAACACTTGACTGACTCGTTTAAGTGTGAGGAAATGTGG 

ACCAATAAGAGAGCCCAGGAAAGCCCAGGAGAATCTGTAGCCCCATGGCT 

GTTGTGTCAGAACC<IAGAGTTTTGTCAACAGAATTTGGTTCCTAAT^ 

CCACTTTATAAAAACGAGTGAGAGAAACAGGAACCTATTCAGATCTGGGG 

TCTGA<5CAATCAGTGGGTGAACATCTAGAGATCTGTTCTTC 

CCAGCTGGCAGAGCATGCGTAAGGCGGGAGGGAACAAGGGCAATCACTCA 

CTCTGGGGCTCAGGCTTGCCCGTTGGGTCAGGTGT^ 

TGTCTGCTTCTCTTGTTACCATCCCTCATC 

ctacttaccaatttcactgqccagtgtccatatttcct^ 

t\mtttaatgagcttgactatgcccg 

gggcaacga^sgcagttiagtggtttccagaaccact^ 

tgggccccaagccagaggaacgtgcatgagccatgaa<5tttccactgata 

aatccacagatgcttctagcacctgcctttctgactcagcctc 

cgcctgccagctgtgaaatcagtgccaacaacaggtaagggagacgcagg 

cgcagggccaggagaggtgtctgacacttccagacaggatgtggaggctg 

acagttgtgaixxsagaggagatggggaggacagagacgggctct^gcttta 

agacaccgagccacaga<x:a<x:aaacaaaagccagggc^^ 

aagtaacagaaaccaaaca<3gcaattctacta<3tt^ 

ctgcatttgccaatcttggtagttttaaaaaacaaaaaca^ 

agcactggcagagctttcctcctctgga<xx:t^^ 

ctctgtggtacactggcttcagacatatctcttgcctatggctggctgac 

Fits,. 6s) 
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ACTTTTAAAAAAGTCTCTGTGGTTAAAAGCATT^CA^^ 



CCTCC 
CCTC? 
AAATG 
CTTTI 
TTTGT 
CCGAG 
TTCTC 
kCCCT 
TGATC 
PGACC 
TTGAG 

CTCCAGAGATC^CTATA^^ 



JAG 



^^? G ^^ GTC ^^CTCTCTCTCTCTTTTTCTTTTGGTCTTTT 

\CTCACTTTGT 
rGCCTCCCGAG 
3TGTCTTTCTC 

TCAGTGGGGCCTTTTCTAGCTACTGAGC7 ~ — — - — 1 u A ^ CCACCCT 



lt=f==I 



CCACTG 



JCCCCCAATCCAGAG 
AGGGAGTTTCAGCTCAGTGAGCAGTCA^PCGGG^C^^^ A 



F I 6> . *>T> (ZC) 
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CCCCCAGAAATCTGGAACTGAGCCATTCCACCCAGAGGCAAGAACAGCAG 
AGGTAAGTTGGGCACACATGG7UVGAAAGGGCCACCCCATTAGTGTCAAAA 
GGGAGGCCAACTTCAGGCCATTGGACACGTTTTAACGCTGACTTCCAGCC 
ATGTACCATGGCATGTGCACACTGTCCATCGCCCACACGAAACATGATGC 
GACGTAAATAAGACCCACGGGCCAGGCAGCTTGGATTGGGCCACAGACAT 



Fi £, (xi) 
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Exonl CeIegE106 
CelegE108 

Exon2 CelegE33 
CelegE36 

CelegE37 
CelegE40 

Exon 3 CelegE41 
CelegE44 

Exon4 CelegE69 
CelegE72 

CelegE73 
CelegE76 

Exon5 CelegE77 
CelegE80 

CelegE81 
CelegE84 

Exon6 CelegElOl 
CelegE104 

Exon7 CelegE109 
CelegElll 

CelegEll4 
CelegE116 

Exon8 CelegE86 
CelegE88 

CelegE89 
CelegE91 

Exon9 CelegE93 
CelegE95 



TCTCCTAGTTGTAGTACATGCT-GTTG 
AGGTCCTGTCTCAAGAAATAGCAATAAC 

TTTGAAGGCCGCTGAAGTCAGAG 
TTGAGTCCCCATCATAAACATATAAATGG 

TTCT AGGCCAAAT AG AAT AATG AG ACTTC 
AGAACTAATrGCATGAGATGAGTGTG 

TGAAGTTOCTGTAATCTGCTCTCTG 
AAGGAGCCTGACTAGAAGCCTC 

TAAACTCCCTACAGTTCACTAACTCAG 
AGCGCTGTTCAGlXJTGAATGTTCTG 

AAAGCCACAGTrGT<nX3TACAGTGAG 
AGGTCTGCATTAGTTGCAATGTTGC 

TATACACCCCCTTATATACACTCAG 
AGAGCCTCTCATAAAGCTGTGGTC 

TTGAACATATATCCGCCAACAACCC 
CTTGGAATACTATAAACTTTCAK3GCTGC 

TAAAGCAACA<K3AAGAGTTGAACTTCITG 
TCCAtZCCTGTGTGCACATGG 

TTACGGTGTCCTAATAATAAGGGCAG 
AATCATGGGTAT1X3TTAACTOGGAAAGC 

T<rrAACAATGT<7rGGCG AGTGTCC 
TCTCTCTCCAGOCCT AG AGTTG 

AGAAGAGGAGCCTGGAACATTGAC 
TTTGTTGGCGCTGAAAGCXrrTG 

TGGCCACAGTAGTGTTTATGATGAC 
TTAATO\ATT<XCTCT<3CAi3AriCTAG 

TG<3CTTACCTATAGGGGGAAATCAAG 
TTGTGTGTGTTGGCTOCAAACACC 
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Exonl'O 
Exonil 



Exonl2 



CelegE98 
CelegEl 00 

CelegEl 17 
CelegEl 20 

CelegEl 21 
CelegEl24 

CelegEl25 
CelegEl28 

CelegEl30 
CeIegEl32 



GGACCATTCTTAAGGACAGCCGAT 
ACATAGTGATCTTTCCATCAGCAAAG 

TGAATGCACAGAGACCCTCCTG 
CCTCTTACCATTCAGATACTGTTAGG 

TTCTTCAGTTGCCAACTCCCAGG 

AAGCTGCTTGTGTGGCAGCAG 
AGTAAGGTG AAC AGG AAAGTACAG AG 

CCCTACACTCACACTCATCTAGC 



Exonl3 


CelegE30 
CelegE32 


CCCTGTGTTCCAGATCTCCATTG 
1 i ACL 1 1 GATCTG AG 


Exonl4 


CelegEl 4 
CelegEl 5 


AGCACCTGAATTCAAATCAGGATGAG 
AAACCAAAGTrCTGAACACAT¥AACTCAC 


Exonl5 


CelegEl 7 
CelegE20 


CTGGTTGCATTCATAGCTGTGTTTC 
ACAGAAGCCAGCATCACTGGG 




CelegE2l 
CelegE24 


TTACTGGTGCTGGGAGGATATGTC 
ATAAGTACTTCATCACCTCAGCGCTC 


Exonl6 


CelegEl 
CeIegE4 


TTGATCTTAGCTGACCAGTGTCTC 
TCTGCATGGACTTGAGCAGAAAGTC 


Exonl7 


CelegE6 
CelegES 






Gele g E9 
CelegE12 


TGTACCTGCAGCCATTGCTTGG 
GGATCTGGGCTCTAGTTTATGTACG 


Exonl8 


CeIegE25 
CelegE27 


T^^ CTATAGGC ^CAGACAGCTC 
A ACTTG ACCTGTGTG ACTT ACGC 


Exonl9 


CelegE193 
CelegE194 


TCACAGTCTATGGTAATCTGTCAAGC 
AAGGGCAACAATGCCCTGGCAA 



Ft*. 5 60 
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Exon20 CelegE195 
CelegE196 

Exon21 CelegE197 
CelegE198 

Exon22 CelegE199 
CelegE200 

Exon23 

Exon24 CelegE203 
CeIegE204 

Exon25 CelegE205 
CelegE206 

Exon26 CelegE207 
CelegE208 

Exon27 CelegE18l 
CelegE182 

Exon28 CelegE171 
'UTR? CelegE172 

CelegE173 
CeIegE174 

CelegE175 
C.elegE176 



CelegE161 
CelegEl62 

CelegE163 
CelegE164 

CelegE165 
CelegE166 

CelegE167 
CelegE168 
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TTCCrGCAAATGGGATAGTCTCTCTG 
ATCCCCCAAGCATTTATCATTCTCAG 

TGTGTTTCCAGAAACCTGCTTTAGTTTG 
TAGTACTTTTGTCCAGGATGACCAAG 

TGACAAGAAATGTCATGTCTTAACATAAGC 
TTCAGAGCCTCCTTCCCCAACT 



TAGTCTGTAGCTGAGGCCATTTTGC 
AAGGAAGCTGCAGTTAAGGGACTGT 

TTGGGACCTTGAGGATTGTTOCC 
CACTCAACAGGTAAAAGTGATCTGCC 

TGCATCTCATCAGTTTCAATCAGAGAG 
AAACTGAGGGCTGAGTTCTGAAAAGG 

CACCAAAGCTCTGTACCACTAAGC 
TGACTGTGCAGTGATGCAGGG 

TTGACCTTGACATTTAGAATAGCCCTC 
GCTGAGAATTAATAATGGTCTTTCTCTTFG 

TACACAGTGAGACCCTGTCTCC 
TAGCTGAGGTCOCTTGTGGAAG 

AGTGTC AG AGG ACC ATGCTGG 
CTTGAAGCGTCCAACTCATGTGC 



AACTCATACATTTGAGCACTGTTGCC 
TGAGGAGGTGGGGAATGCTAATG 

ACATAGCAGAGGGCTGCTCAC 
ACTGACCTGlGGGAACCTGTG 

AATGCTAGGCATCATGCGTCTTCTAG 
AACATCTAATAGGGACTGAGTGACCC 

TrCTGTGGTGGdTGGCAAGAG 
CACACATACACACACACTAAGTGCC 
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CelegE169 
CelegE170 

CelegEl45 
CelegEl48 

CelegE150 
CelegE152 

CelegEl53 
CelegEl54 



TGGTAGTTACTT A AAGTTTAC AAG A AT(~T Afr 
AAATGCTGGGATAAAAAGCAT^AACO^ 

ACGACACTGACCTCTTGCACTTG 

IS^^ CCCTGAATGTCT GAACATTC 
GCGTTCATTTCTGCTCCTGTA ATGG 

TGAGCTCTTAATCCCTGCCATTCC 
TAGGGCTTGTCCGTCCATAAGG 



CelegE157 
CelegE158 

Ce]egEl59 
CelegEl60 

CelegEl41 
CelegE144 

C.elegEl29 
CelegE132 

CelegEl34 
CelegE136 

CelegE137 
CelegEl40 



TGTTACGGAGATGAAAGGCTAGACC 
TAAGCCCTGCACTAACGCACTC 

TGTTTTGAGAGGTGAGCCTTCTAGC 
CATGTCCTACAGTTCTGCTATCACC 

TCTCTAAGGTGCACTGTTGTGGCT 

J GGA 4 G f^ GACT CTrGAGTrGc 

GTCTGC ATTTT AAATGCAG ATGTG AAGC 
CGAAACGCACGCACATTnTACCAr 

ctgtgatttagcatctgtcgcaotS 
tccagagtacagggagaaactaaagg 
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AGCGCTATTCAGCTGTGCCTCCTTTGCTGTCTTGGCTCCTCCTGGAGCACTAT 
ATGCACCCATGTCCTTACCAGGCCITTCACAGACGCTGCCATT<3A<jACKjGT 

TGATGCAGGTTGCAGCCTTTAATCCGCGAGTACTAGGCTCTGACAAGATGCCA 

CAGAAGCCAGCATCACTGGGCTCAGATGGCATCCACTGCAGCAAACTATTTn 

TGAATGGAGACATATCC 
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GAATTCCGGGCGAAGGGGAGC(X3GCGTGCGGGGTGT^ 

GCGGTGGCGGOXKXSGCGGCGACTGAGGCGCGGC^ 

GCAGGCAGCACCGACCXTTOCACCGCGAC^GGGGCCTG^ 

GTCGCGGGCGCTGCCCCCGCCGCCGCTGCTGCCGCTGCTCTT^ 

GCCGCTGCGGTGGCGGCGGCGGTGTCCGGCra 

GCCXXrroCAACCCTGGCACX^3GCCAGTGCX^ 

CTTCAGGACATCTGTCTCACGCCTATAAT^ 

GCTACAGAATAAGTTCAAGAGTAACCTGGGGCAA 

AGA G ^lHr r TTTKSGGAAAATTTTAGCTGACTAATTTTTCACCGAGAACTAA 
CTGGGAATTATAAATATAAGACGAAGTGCAC^TGGCT 
CCATTTTGCTACAGAATGTAGCTGGGACCATTTATATGTTTAT^ 
TTTAGTGGCCTCATTGTTCXrTGAAAGAGATGGC^ 

TGC AT'rTTTTCAGTG ATGCTGCTTT AT AATCTG ACTGG ATTT AAT ATC ACTT ACAATTTTC AC^TGTGTCGG AAT AAT1K5 

CTCAGGCCGAGGAGAGTGTAAGAGCAGTAACAGCAGCAGOG CT \5TTK 5 AG^ 

TCCTIXnXSACATTCCrrcACTGTACAGACAACT^ 

GCTCCTGCTTTCCTCACTGGCAGGGTCXr^ 

ATATTCTGATTTAAAGCTTCCCAGAGCCTCT^ 

ATGTTCAACCATTCAGATTACAGCATGGTTTTAGCGTATGACC 

TGAACAGTGTGGTTGTAAGATATGGTCATTCTTTGGCATTA^ 

TTCAACAGGGAACGTGACCAATGAGCTGAGAGTATTTCATAT^ 

AAGGATCAGTATGCAGTGGTTGGACACTCAGCACACATTC 

TCGGTCATTGCX^J^JCTCTATGGATATATAAGCGT^ 

TACTCAGGGTGCTCTTGTGCAAGGGGGTCAT^ 

GGTGGCTACAAGGCTTTCAGCGCCAACAAATACCGGCTTGCAGATGAOCT 
GGACCMTCTTAAGGACAGCCXIATTTTTC 

AGGGAACACACACAATGACACTTCCATGAGCCACGGTGCCAAATGCTTCTGCTOGGACTTCATG 
TGTGACCGATGGTCAGTGCTTCCCAGACCTKSA 

ACAGCACCATGTATGTGTTCGGOGGCTTCAACAGOCTCCTOCTCAGTC 

TGCACACCGCAGTGAAGCriXXrrrGTGTGGCAGCAGGA 

ACCTCCTXXXSAGTTGGCAACTGAAGAACAAGCAG^ 

ACAGATGTGACCAGCACACAGATTGTTACAGCTGCACAGCCAATAGCAA 
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«^RF WKATECSWDHLyvYDGDsr 
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1 41 81 121 1G1 201 241 281 321 361 401 441 483 



Transmembrane Segments Predicted by MEMS AT 



Start 


End 


Orient 
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22 


38 


out— >ins 


1.7 


50 


67 


ins— >out 
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out— >ins 
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Signal Peptide Predictions 



Method 


Predict 


Score 


Mat@ 


SignalP (eukaryote) 


NO 



Ft*,. 8 £- 



BNSDOCtO: <WO___QpOS373A2„t. > 



WO 00/05373 



67 / 89 



PCT7US99/16484 



GAATTCCGG. 




Pis. <? a 



Fi 6. ^6 
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»«« 2.1 1.7^8.5 
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Signal Peptide Predictions 



Method 


Predict 


Score 


Mat@ 


SignaiP (eukaryote) 
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CCT ACATT^TTTGT AAAATTT AAAT AGCT ACCATTGGCAATCTGOl "I "1 1 "l'l'l CI ' AAAA TOPG ATTTGCAGGCAGG AAAG A 

ATTTTCTCACCCAAGGAACATTTGATCT AGCAGCAGGG ATG AGAGG AAAGCAG AAATG AATG AACTGTGAAA<*CTGCTG 

TTTTTATTATCAAAAAGGACkCTGTCAAGA^ 

CAGAGGAAAGGACTCATTCATCTCACGCTTCC^ 

AGCATCTGTGTGTCXrn^CAGCCTC^ 

ACAGCCTCXX^' ri^ ^GGCAGAAGCTGCCC^ 

GACCCACCCACCCTCXrCTCTCAGACXIA^ 

GGAAAAAAATGGACG AGGAGGGAAAACTCTGCCAAATGG AAAATG ACCAAATTT AAG AGGGTGGG ACAGTGCCCTGCTC 

CTCTCCCAGAGGGCACTGCTTGGAAATTCrrCTrTTTC^ 

CX^GAAGCTCnTTTCTGCTTCAA^ 

AGTTTGTGAATTTGGAGAG AT ACTCAAAAG AGCT AAAACTGCAGCATTTT AC<TTTT AAATGCAGTGCCT AGA 

TATTGTCTCTTCCCX^AC^CTAACCX^^ 

AACACTATCTGATGCACAGAACACCTCTACTT^ 

CCAGACGTTCTAGAAAAGACCCCTCCTCTCATGA 

GTGG ACTGGCCCCTTAATTCCXZACAGGCCCCCCCAGCAAGGCCAAAGGG 
AGATCCTCTTTGTTTGTTCAAAGGACCAGTTTT^ 

AT ATCATGCACCATG ACCCACAGC CATCT^XTI^ ATGTCTT ATTTTT*TTCCT AAAAG AT AATGTTT ATTTTTAAAAAGG A 

AGGAAGAAGCAAGTGAAGTTTCATTCTXKrrcC^ 

CAGCAAACAGC n "l»rC M IK:CX3GCCTCAGGG<^ 

CCAAGAAGGAATTGGTTGTCATCTGGCAGTGTTGC^^ 

CACTGACCTTGCACTTGTACATAACTATACAGT^^ 

AATCTTCATGTAl'rrrrATTAAATATAAC^ 

GGTTCTCGCCAGGCXTCXX^TACATGAATAACAJA 

GCACCAAl^AGGTATTTCTTAAAACAGGACTCATCTGTCAGAGTGCA 
GCGCTGGAGGAG ACTCAGGAAGCAGAGGCGTCXXrEK^Ca 
AGCCTeiTCGTGCnXJ' l XXXrrACTGAG^ 
CTAACCTGTAGCAATCACA Cn ' rK ^ 

CTGGAATCAGTTTATTATACTGAAAACTGGGGGTGGGACT 

TGGAGAATTTX^C^TACCCTGGACTCCnHJ'X 

CXXrTGTGGAGAGAAGGCAACCCCAGATGCCX^ 

AAGGAAAGTACTGGACTACCCCTGGGTAAjGTGCTGOCATTCAAGACTGGAJG 
CTGCTGGTGGGAAGAC5GCATTTTACCTTCCAGTGCAAATCCTG 
CTX^TTCACTTCCTTGGGAGATGGTGGTCT 
CAACAAGTATTTGCTAAACACTAACTTAAOCT 

ACAAAATCCAAGTCCTCACACCGCTCTPCATOCCAGGAGATCTTTCCTTK*^ 
AGGAC^CAGCCAGGACGGCAGAGGCCTXXTT^^ 

CAAACTTGGCTCCAGCCATTGCGGTX3GTTTCTAGATAGGCAGGGK3CACC 
CACCXrrGCCTACAAGGACATGTl'TTGAAATGGAGATC 
TTTX^CTAACTXSAAUn n xri XX ^CAAAACATTCT 
A Wm'l'lUl i rr A TACTTATTTACCATTTC Xmnx^ 
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^CNKKTSCRSCALDQNCQWEPRNQECIALPE^^^ 

VE^^QLRIMQSSQSMSKLTLTPWVGPSGRXNVSYX^ 

I^TCIOPTXKWSW.GUJTTVLRQCRTPCAI^TACGIXrrSGSSEXH^ 
rcPPl^CSGYCTCSHCI^QP<X*3V^^ 

CTACCCNGHSKCINQSXCEK^^ 

LCEVENRYOGNPlJlGTCyVTLLIDYQFTFSLSQEDDRYyTAINFVATP^EQNRDLWlFI^ 

oageempwsktnikeykdsfsWdfi^pnitftvwsnftwpikiqiafsq^ 

AAVVTOXKQSCWASRJ^OI^EHOQMASRPFASVNVALI^EEPP^^ 
PRGLGG I PPPGQSGI^VASAI^VDISQQMPXVYKEKSGAVRNRKCXJPPAQPGTCI 
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10 20 30 40 50 60 

inputs MVAVAAAAATEARLRGSTTTTAAPAG 

KVA - - AAAATEARLRRRT AAT AALAGI^G GPH 

10 20 L {j 30 

*L, I? < \' F 

"70 80 90 100 110 120 

inputs PPPLLPLLFSLLLLPLPREAEAAAVAAAVSGSAAA£^ 

CVNGGRCNPGTGQCVCP 

40 

130 Clfi 140 150 160 170 180 

inputs TGWVGEOg^CGGRFRLTGSSGF^ 

AGWVGEQCQHCGGRFRLTGSSGFVTriXSPGNYKYKTK^ 
50 J L 60 70 80 90 100 

190 200 210 220 230 240 

i npu t s S WDHLYVYDGDS I YAPLI AAF SGLTVPERDGliETAPEVTVTSGyALLHFFSDAAYNLTGr 

SVTDHLYVYDGDSIYAPLVAAFSGLIVPERDGNETVTE\A/ATSGYA 

110 120 130 140 150 160 



250 ^260^ 270 280 290 ^^300 

inputs NIT2jN§pMCPNNCSGRGECK£S 

:::::::::::: .:::::::::::„ 

NITYSroMCPl^CSGRGECKISNSSimnECECSENWKGEACDIPHCTDNCGFPHRGICNS 
170 f ' 180 ^90 200 -blO 220 

310 320 330 ^-J^Io^ 350 '-^^SO 

inputs SDXRGCSCFSIXtfQGPCpgyT^ 

SDVRGCSCFSWQGPGCSVPVPANQSFWTREEYSm^PRASHKAVVNGN 
230 240 250 260 270 280 

370 380 390 400 410 420 

inputs NHSDYNMVIAYDI*A£REWLPLiNRSVNN 

NHSDYNMVTAYDLASREWLPLNRSVN^^ 

290 300 310 320 330 340 

430 440 4S0 470 480 

inputs RVFHIKNESWVLLTPKAKEQYAW^H^AHIV^ 

........ 

RVTHIHNESWVLLTPKAKEQYAVVGHSAHIVTLKNGRVVMLVIFGHCPI.YGYIS 
350 360 370 380 390 400 

490 500 . 510 520 ^ 530 540 

i npu t s LDKNTWSI LKTOGALVQGGY^t^SVYI^RTRALYVHGGYKAFSAlWYRIADDLYRYDVt^ 
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LDKNTWS I LHTOGALVCK^YGHSSNTTDHRTRfcLYVHGGYK^ 

410 420 430 440 450 460 

550 560 ^^70 580 590 600 

inputs QMOTILKDSRFFRYJ^^AVIVSCTI^ 

QMVrTILKDSRFFF.YUrrAVTTVSGTHL^ 

470 480 490 500 510 520 

610 620 630 640 650 660 

i npu t s N^PRPDI^DVNRF^^AVLHNSTMYVFGGFNS LLLSDI LVFTSE^TDAHRSEAACUUVG 

VLPRPDSTMMSTDUvI PAVLHKSThrTVTGGFNSLJuLSDIIjVFTSEOC-^ 
530 540 550 560 570 580 

p («_>U — / 

670 680 690 700 r 710 720 

inputs PGIRCVVWTGSSCKTISWU^TDE 

, ••♦* , ** ,, ;;;;i;itiJ!iJl!!J!l!l 

PGIRCWNTGSSQCISWAI^TDEQEEKLKSEC^ 

590 600 610 620 630 £ 6 ^j^ V C^o^ <J**1- 

730 740 750 760 770 ^*780 

inputs WCNDHCWRNHSCSEGOISIFRYENCPKDNPhTTyCNKKTSGR^ 

VJ^KDHCWR>mSCSEGCI SI FRVENCPKDNPrT^ 

ZS0 660 670 680 690 700 

790 800 810 820 830 840 

i npu t s AlJ? EN l^G I G WH L VGN S C LK I TT AKENYDN AKL FCRNHNAI*l*AS LTTQ KKVEFVUCQLRI 

P ET J I C G I GWH L VGN SCL.KI TTAKENY DN AKLFCRNHNAL.LAS LTTQKKVEFVLKQLRI 
710 720 730 740 750 760 

850 860 870 880 890 900 

inputs MOSSQSMSKLTLTPWGLRKINVSYWCV^DMSPFTOSLliQVWPSEPSDAGFCGILSEPST 

MQSSQSMSKLTLTPWGLRKIKVSYWCVreraSPF^SLI^WMPSEPSDAGFCGILSEPST 
770 780 790 800 810 820 

0 U**~ ****** 

910 920 930 1 940 ' 9S0 960 

inputs RGLKAATCINPJ^GSV^RPANH^QCRT 

RG LKAATC I N P LNG SVCERP ANHS AKQC RT PCAL.RTACG DCTSGS SECMWC SNMKQCVDS 
830 840 _T 850 860 870 880 

970 980 990 1000 1010 1020 

inputs NAYVASFPFGQCMEWYTMSTCPPENC 

NAYVAS F PFGQCMEWYTMSTC P PE2*CSGYCTCSHCIrfEQ FGGGWCTDPSNTGKGKCXEGS Y 
890 900 910 920 930 940 

1030 1040 1050 1060 1070 1080 

inputs KCPVKMPSQAPTGNFYPQPLI2*SSMCLEDSRYNW^ 

KGPVTQ^PSOAPTGNF^PQPLliJSSMCLEDSRYNWSFIHCPACQC^GHSKCINQSi^ERCE 
950 960 970 980 990 1000 

1090 1100 1110 1120 1130 1140 
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inputs N L TTG KHC ETC X SG FYG D PTNGGKCO DC KCNG HAS LCNTTTTG KC FCTTKC* V K(i UiCCU i^C t. 

NLTTGKHCETCI S G FYG D PTOGGKCQ PCKCNGHAS liClTITrPGKC FCTTKGTVKG DECQ LC E 
1010 1020 1030 1040 1050 1060 

1150 V160 1170 1180 1190 1200 

i npu t s VENRYQGNPLRGTCYYTLLIDY QFTFSLSQEDDRYYTAINFVATPDEQNRDLDMFINASK 

VTNRYQGNPLRGTCTYYTLLIDYQFTFSLSQEDDRYYTAINFVATPD 
1070 1080 1090 1100 1110 1120 

1210 . 1220 1230 1240 1250 1260 

inputs NFNI^ITWAASFSAGTQAGEEMFWSKTNIKEYra 

••»•••••••••■•••••••••••»••••••••*•»•*■•*"*■••*•••••*•*•*••" 

KFNLIUTWAASFSAGTQAGEEMFWSKTNIKE^ 
1130 1140 1150 1160 1170 1180 

1270 1280 1290 1300 1310 1320 

inputs WPIKIQIAFSQHSNFHOLVQFFVTFFSCFLSLIXVAAVVV^^ 

* • - - " 

WPIKIQV QT EQ 

1190 

1330 1340 1350 1360 1370 1380 

inputs QMASRPFASVNVAIJS^DEEPFDLIGGSIKTVPKPIAL^ 



1390 1400 1410 1420 

inputs IPPPGQSGLAVASA1-VDISQQMFIVYKEKSGAVRNRK0QPPA0PGTCIN 
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ATGGTGGCGGTGGCCGCAGCGGCGGCAACTGAtJGCAA<^CTGAGGAGGACGACGGCGGCGACGGCAGe<;CTCGGCGGCftGGAGCGGa;GGCC 
GCACCGACCCTGCACCGCGACAGGGGCCTGGAGGCCGGGACCGCGeGCCCGGCTGTGTCTTC 

CGCCGCTGCTGCCGCTGCTCTTTTCGCTGCTGCTGCTGGGGCTGCCCCGGGAGGCCGAGGCCGCT,GCGGTGGCGGCGGCGGTCTCG<JGCTCG 

GCCGCAGCCGAGGCCAAGGAATGTGAOCGGCCGTGTGTCAACGGCGGTCGCTGCAACCCTGGCACCGGGCAGTGCG 

GGTGGGCGAGCAATGCCAGCACTGCGGGGGCCGCTTCAGACTAACTGGATCTTCTGGGTTTGTGACA^ 

AAACGAAGTGCACGTGGCTCATTGAAGGACAGCCAAATAGAATAATGAGACTTGGTTTCAATCATTTTGCTACAGAGTGTAGTTGGGACCAT 

TTATATGTTTATGATGGGGACTCAATTTATGCACCGCTAGTTGCTGCATTTAGTGGGCTCATTGTTGCTGAGAGAGATGGCAATGAGACTGT 

CCCTGAGGTTGTTGCCACATCAGGTTATGCCTTGCTGCATTTTTTTAGTGATGCTGCT7ATAATTTGACTGGATTTAATATTACTTACAGTT 

TTGATATGTGTCCAAATAACTGCTCAGGCCGAGGAGAGTGTAAGATCAGTAATAGCAGCGATACTGTTC 

AAAGGTGAAGCATGTGACATTGCTCACTGTACAGACAACTGTGGTTTTCCTC 

CTGCTTCTCAGACTGGCAGGGTCCTGGATGTTCAGTTCCTGTACCAGCTAACCAGTCATTTTGGACT^ 

TCCCCAGAGCATCTCATAAAGCTGTGGTCAATGGAAACATTATGTGGGTTGTTGGAGGAT^ 

CTAGCGTATGACCTTGCTTCTAGGGAGTGGCTTCCACTAAACCGTTCTGTGAACAATC 

CAAGGATAAAATTTACATGTATGGAGGAAAAATTGATTCAACTGGGAATGTGACCAATGAGTTGAGAGT 

GGGTGTTGTTGACCCCTAAGGCAAAGGAGCAGTATGCAGTGGTTGGGCACTCTGCACACATTGTTAGA^ 

CTGGTCATCTTTGGTCACTGCCCTCTCTATGGATATATAAGCAATGTGCAGGAATATGATTTGGA^ 

CCAGGGTGCCCTTGTGCAAGGGGGTTACGGC<^TAGCAGTGTTTACGACCAT^ 

TCAGTGCCAATAAGTACCGGCTTCCAGATGATCTCTACCGATATGATGTGGATA^ 

CGTTACTTGCACACAGCTGTGATAGTGAGTGGAACCATGCTGGTGTTTGGGGGAAACACACACAATGACACA 
AT G C TTCTC TTCAGATTTCATGG OCT ATGACATTGCCTGT6AC£ 
TTGGCCATTCAGCAGTCTTACACAACAGCAC{^TGTATGTGTTCGGTC 

G AACAG TG TG ATGCGCATCGG AGTG AAGCCGCTTGT TT AGCAGCAGGAOCTGGTATTCGG TGTGTGTGG AACACAGGGTCGTCTC AGTGT AT 
CTCGTGGGCGCTGGCAACTGATGAACAAGAAGAAAAGTTAAAATCAGAATGTTTTTCCAAAA^ 

acacagattgttacagctgcacagccaacac<yuvtgactgccactcgtccaatga^ 

cagatctccatttttaggtatgagaattgecccaaggataxccctatgtactactgtaa^ 

ccagaactgccagtgggagc<:ccggaatcaggagtgcatt<k:gctgcccgaaaatatctgtggcatt^ 

gtttgaaaattactactgccaaggagaattatgacaatgctaaattgttct^ 

aagaaggtagaattt^ccttaag^gctgcgaataatgcagt^tc 

GAAGATCAATGTGTCCTACTGGTGCTGGGAAGATATGTCCCCATTTACA^ 

GATTCTGTGGAATTTTATCAGAACCCjUSTACTCGGGGACTGAAGGC^^ 

GCAAACCACJlGTGCTAAGCAGTGCCGGACACCATGTGCCTTCAGGACAGa 

CAGCAACATGAAGCAGTGTGTGGACTCOJlTGCCTATGTGGC^ 

CCCCTGAAAATTCTTCAGGCTACTCTACCTGTAGT^ 

GGGAAATGCATAGAGGCTTCCTATAAAGGA<X^GTGAAGATGCCTTCGC^ 

CAGCATGTGTCTAGAGGACAGCAGATACAACTGGTCTTTCATTOICTGTCCAGCTTC 

GCATCTGTGAGAAGTGTGAGAACGTGACOVCAGGCAAGCAC 

T GT CAG C CAT GCAAGTG CAAT G GG CAC G GG T< C TC TG TGCAACACCAACACGGGCAAG TG C TT CTG CACCACCAAGGGOG T-C AAGGGGG ACG A 
GTGCCAGCTATGTGAGGTAGAAAATCGATAGCAAGGAAACCCTCTCAGAGGAX^ 
TTAGTCTATCCCAGGAAGATGATCGCTATTACACAGCTATCAATTTTGTGGCTA^ 
AATGC CTCCAAG AATCTCAACCTCAACATC ACCTGGGCTGCCAGTTTCTC^ 

ATTTCACCTGGCCCATCAAAATTGAGATTGOCTTCTCT^ 

ttcctctctttgctcctggtggctgctgtggtttcgaagatcaaac 
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GCAACAGATGGCCAGCCGTCCCTTTGCCTCTGTAAATGTCGCCTTGGAAACAGATGAGGAGCCTCCTGATCTTATTGGGGCGAGTATAAAGA 
CTGTTCCCAAACCCATTGCACTGGAGCCGTGTTTTGGCAACAAAGCCGCTGTCCTCTCTGTGTTTGTGAGGCTCCCTCGAGGCCTGGGTGGC 
ATCCCTCCTCCTGGGCAGTGAGGTCTTGCTGTGGCCAGCGCCCTGGTGGACATTTCTCAGCAGATGCCGATAGTGTACAAGGAGAAGTCAGG 
AGCCGTGAGAAACCGGAAGCAGCAGCCCCCTGCACAGCCTGGGACCTGCATCTGATGCTGGGGCCAGGGACTCTCCCACGCACGAGCTAGTG 
AGTGGCACACCAGAGCCATCTGCAC^GAAGGGCGTGGCGGGGAAATGGCTGTGCGGTGCGGGACGGAAGACTGGAAACCCT^ 

ACTCACCTGCATGATCACAAGCTTTCTTTGACGGTTTCTCCCATCCGTGTTCCAGCATCTAACCTTTTACTTTTGCATAGGAAATACTTGAT 
TTAATTACAGGTCC^GGGATGAGCTGATGGTTGCTGGAGGAGGCCAGTGTAGAGCCAGTGAGAGAACTAGGAA^^ 

GGAAAACTGTTCTTGGGACTGTGTCAACTGTGCAAAAAACAAAAGATGGAGTGTTTACAAGTAGACATTCGTCATCAGTTGTTCTTGAACAT 

GGTCTTTTAAAAACTAGTCAGATGAATTAACTTGTTTTCATCTGAAGCCTGCTATCTTTTTTAAAAGATGTGCTATTTATTCTTGCACGATT 

TAGGCAATTATCTCTGTTCGAGGGAGTACCTTTTTTTCTAGTTGAGAATTAATAATGGTCCATCTCTTTTGATCATATCAAGCTAGGAIAGA 

AGGGGGGCTATTTTAAATGTCAAGGTCAGCAGTGTTACTTTGAATGTAAACTGGTATAATAGGTAGTTTTCTATAGTAACTTGATTAATTTA 

GTCTTAATCCATTTGAAACTCTCTCTTCCTTTCTCTCTGCCTGTCCCTCTCCTTCTCCATCTCACCCTCCCTCTGTCACACATACACACACA 

AACACATACACACAACACTAAGTGCCTAGACTTTAAATAGATCTAGCAATTGGAAAGTTAGTAAGCCTAAGTTTTTACATAATTGCATTCCT 
*"""TGTAAAATTTAAATAGCTACCAT^^ 

AACATTTGAICTAGCAGCAGGGATGAGAGGAAAGCAGAAAIGAATGAACTGTGAAAGCTCCTGTTTTTATIATCAAAAAGGACACTGTCAAG 
AAGGCGCCCCCTGCCCCCACCCCCGTGTCACCCTAGGCCTGATAAGeGATCAGAGGAAAGGACTCATTCATGTCACGCTTCCTTGAGCAGAA 
^^^*^^*^^*^*^^*^'^^^*^^^^^ A ^^^*^*^^^^^^^^^^^^^^^^*SCCTCCTCTGAACTTGTGGTTCATTCTCAGGCTGGG 
GTGGACTCAGATGCCAGGAAAGGGACAGCCTCCCATTGTCAGGCAGAAGCTGCCCAAAGCCTGGAGAAGGACTTGTTTGCCCTCTTTCCCCC 
AGGAGGGGCTCGACCCACCCACCGTCCCTCTCAGACCA.GGTGGTGGCTGTGAGGAGGGCAGGAAATGCTGACAAGG^^ 

AAAAAAATGGACGAGGAGGGAAAACTCTGCCAAATGGAAAATGACCAAATTTAAGAGGGTGGGACAGTCCCCTGCTCCTCT-CCCAGAGGGCA 

CTGCTTGGAAATTGTGTTITCCCCATTTATGGTGCICTGTATTCTGGCATTATGCAGCAGCCTCCCAGAAGCTCTCTTCTGCTTCAAAACCT 

GGGATCTCTGGCATTACCCTATTGGGATGGACCGCTGGA«GCAATGCTCGAGTTTGTGAATTTGGAGAGATACTCAAAAGAGCTA^ 

CAGCATTTTACCTTTAAATGCAGTGCCTAGAGAGAGAGTATTGTCTCTTCCCCAACACTAACCCCACTCCCATGAAGAATTGCCTGGAAAGA 
TGTTTTCAA^AATTTGAACCATAAAACACTATCTGATGCACAGAACACCT^ 

TGGACTGGCCCCTTAATTCCCACAGGCCCCCCCAGCAAGGCCAAAGGGAGGCCCCTGG 



TACIGTTAAAGACCAGACGTTCTAGAAAAGACCCCTCCTCTCATGAGCTCCCCCATCCCTGCTACAGAACACAGCACCCATGGCGCCTGC 
TGTTCAAAGGACCAGTTTTCCTAGGCCAAAGAAGTCTCTTCCCCATGTTAGTCCTATGCCTTGAAATATCATGCACCATGACCCACAGCC* 



CTGGTTATGTCTTATTTTTTTCCTAAAAGATAATGTTTATTTTTAAAAAGGAAGGAAGAAGCAAGTGAAGTT^CATTCTGCTCCAGCGGTGG 
GGAAGCCGCTGAATCCACCTGCTTCTCCTTTGCAACCGACAGCAAACAGCTTTCTCCGG^^^ 

AGAGGCGCTGGGCTCGGAGCCTGTTTCCAAGAA^AATTGGTTGTCATCTGGCAGTG^ 

ATAGTCAAGACAI^CTGACCTTGCACTTGTACATAACTATACAGTAGTGTCCA^ 

AAICTTCAIGIATTTTTATTAAATATAACAATGTCTGAGTTTCACCIAAGATGTTTTTGTGCCAIAIGCTGGAtATCCAGGTTCTCGCCAGG 
GGACTC*TCTGTCAGAGTGCACA TC ^ 

^^^^^^^^^^^^^^^**^^*^*^*^*^^^^^^^^^^^*^^^^^^^^^^^^^^^^^^*SIGAGGGATGACCAGTCTTGTCCTGAGAAAT 

TTAATGACCATTTCCTGGAATC^GTTTATTATACTGAAAACTGGGGGTGGGAGTAG^ 
GTGGAGAATTTGACATACCCTGGACTCCTGTGTGCCTCCTGCCATC^^^ 

aaggcaaccccagatcccctgagctaacccggaggaaaggcagtcctggacagaagactgtcagcagaaggaXagtactggactacgcgtgg 

TCCTGCTCCTTTGATTTAATGGGGTGTACTGGGGCCAGGGGCTGATTCACTTCCTT^ 

ttccatttcatttattcatccatccattcaacaagtatttgctaaacactaacttaagctaatgctagggtagtgactgagatgtaaaaata 

GATTTTAGAATTAAAACAAAATCC^GTCCTCACACC^TGTCATCCCAGGAGATCT^^^^ 
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TGAGGACACAGCCAGGACGGCAGAGGCCTCCTGGCCT-CAGGGCATGCCCTGCCTACCTTCTGAAATGTTTACCCCATTGACCAAACTTGGCT 
CCAGCCATTGCGGTGGTTTCTAGATAGCCAGGCCCACCAAGAGATATTGCCCCTT-GATGAGAGTCAAACACCCTGCCTACAAGGAGATGTTT 
TGAAATGGAGAGGAAAATTGGCACCTCATCTTTTAAAGGCAGTAATGGAATTGATTTTCAGT 

ACTAGTGAAGCCTGTTTCGTTGAAGTAATTCTGGCTCTGGAAATGTTTTTGTTTTATAGTTATTTACGATTTGGTTTGTTTGGATTCAAGCT 

TAGTTTGTTAATATGTATAATTTAGCATCTATTACACTCATGTAAATATGGAGTAAGTATTGTAAACTATTTCATTGCGGGGATTGTGGGTG 

TTATACATACATTTAGGACTGCAATTTTTTGGTATTTTTTGTATTGTAAAATAACAGCTAATTTAAGCAGGAAGAAGAGAACTAAGGGAGGT 

CTGTGCATTTTAAACACAAATGTGAAGAACTTGTATATAAACAA/U\^ 

GTAAuAAAAAAAAAAAGAAAAAAAAAAAAAAAAA 
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mvavaaaaatearlrrrtaa: : ^.lagrsggphrpctatgawrpgprap.i-lprvlsralppppllpllfsllllplpreaeaaavaaavsgs 

AAAEAKECDRPCVNGGRCKr 7 SOCVCPAGWVGEOCQHCGGRFRLTGSSGrVTDGPGNyKYKTKCTWLIEGOPNRIMRLRFNHFATECSWDH 
LrVYDGDSIYAPLVAAFSG:.: .•PERDGNETVPEWATSGYALLHFFS.D^.yNLTGFNITySFDMCPNNCSGRGECKISNSSDTVECECSENW 
KGEACDI PHCTDNCGFPHRC- Z - N ^SDVRGCSCFSDWOGPGCSVPVPANOSFVTREEYSNLKLPRASHKAVVNGNIMWyVGGYMFNHSDYNMV 
LAYDLASREWLPLNRSVNN\-r/RYGHSLALYKDKIYMYGGKIDSTGNVT»lELRVFHIHNESWVLLTPKAKEQYAVVGHSAHIVTLKNGRVVM 
LVIFGHCPLYGYISNVOEYniDKNTWSILHTOGALVOGGYGHSSVYDHRTRALYVHGGYKAFSANKYRlADDLYRYD 
RYLHTAVIVSGTMIA/FGGNTHNDTSMSHGAKCFSSDFMAYM^^ 

EOCDAHRSEAACLAAGPGIP- ^- ^NTGSSOCISWALATDEQEEKLKSECFSkRTLDHDRCDOHTDCYSCTANTNDCHVJCNDHCVPRNHSCSEG 
OISIFRYENCPKDNPMYYC f ;-:<TSCRSCALDONCOWEPRNOECIALPEtaCGIGWHLVGNSCLKITTAKENYDNAkLFCRNHNALLASLTTO 
KKVE FVLKQLRIMQSSQSM£ ^-TLTPWVGLRKINVSYWCWEDMSPFTNSI.LOWMPSEPSDAGFCGILSEPSTRGLIO^TCINPLNGSVCERP 

anhsakocrtpcalrtacgl;tsgssecmwcsnmkocvdsnayvasfpfgc^mewytmstcppencsgyctcshcleopgggwctdpsktgk 

GKCIEGSYKGPVKMPSOAPT:-NFYPOPLLNSSMCLEDSRYNWSFIHCPACOCNGHSKCINOSICEKCENLTTGKHCEtCISGFYGDPTNGGK 
CQPCKCNGHASLCNTNTGKCFCTTKGVKGDEC^^^ 

NASKNFNLNITWAASFSAGTOAGEEMPVVSKTNIKEYKDSFSNEKFDFRNHPNITF 

FLSLLLVAAVWKIKOSCWASRRREOLLREMCK>MASRPFASVNVALETDEEPPDLIGGSIKTVPKPIAI.EPCre^ 

IPPPGOSGLAVASALVDISOOMPIVYKEKSGAVRNIU((X)PPAOPGTCICWGOGLSHAJ^SEWHTRAICREGRGGE^VRCGTEDKKPSKHLT 

HLHDHKLSLTVSPIRVPASr^LLLHRKYLILOVQGADGCWRRPVSOENEHSGSLWKTVLGTVSTVQKTKDGVFTSRHSSSVVLEHGLLKTSQ 
"N'LVFISLLSFLKDVLFILAP.FRQLSLFQGVPFFLV^^^ 

vrLLLHLTLPLSHIHTOTHTHNTKCLDFKIQLESAVFTLHSYILVKFKLPLAICFFSKIFAARKEFSHPRNISSSRDERKAEMNELKLLFLL 

SKRTLSRRRPLPPPPCHPRPDKRSEERTHSCHASLSRKEHEHLGPLDORASVCPAASSELWHSOAGVDSDARKGTASHCQAEAAOSLEKDL 

FALFPPGGARPTHPPSQTKWAVRI^NADKDEKHMEKNGRGGKTLPNGKPNLRGWDSPLI^SORALLGNCVFPIYGALYSGIMQOPPR^ 

LLONLGSLALPYWDGPLDStJARVCEFGEILKRAKTAAFYI^OCLEREYCLFPNTNPTPMKNCLERCFQGITIKHyLMHRTPLLDSPLIKLLF 

HITVKDOTFKRPLLSAPPSLLONTAPMAPAVDWPLNSHRPPC^OREAPGYCPPTRKILFVCSKDQFSAKEVSSPCSYAiKYHAPPTAIWLC 

LIFFLKDNVYFKGRKKOVKFHSAPAVGKPLNPPASPLQPTANSFLRPOGRKREWOnmGAGl^ACFQEGIGCHlAVI.RWrREPVYKLKSRQH 

PCTCTLYSSVQNVOTFGVYIKOKKSSCIFIKYNNVVSPKMFLCHMLDIQV1ARPRYMNNKPKKRIPIVCVQMH1APIRYFLKODSSVRVHMK 

NOAGNRNDSAGGDSGSRGVPAAALGPASTSPFI^SHCSGGMTSLVLRNVSI.SLVQRLTCSNOTFQKGVUiFLFCLNFPFPGISLLYKLGVG 

VGSFVDKFPFPRGEFDI PWTPVCLLPSLHTAWGEACASPCGEKATPDPLSPGGKAVLDRRLSAEGKYWTTRGVLPFKTGDTWEIKRAGH 

EEAFYLPVOILLLFNGVYWGOGLIHFLGRWWCFHEHLSFHFIYSSIHSTSICTLTANARVVTEMKILELKQNPSPHTPVIPGDLSLWWFLEL 

AILRTOPGRORPPGLRACPAYLLKCLPHPNLAPAIAWSRPGPPRDIAPESNTLPTRRCFEMERKIGTSSFKGSNGIDFQLNLCTKHSKHSL 

FRTNSGSGNVFVLLFTISFVWIOAFVNMYNIASITLMIWSKYCKLFHCGDGGCYTYIDCNFIVFFVLNNSFKQEQENG 
KQKI LYKLPSEI KVDLVKKKKEKKKKK 
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ATGGTGGCGGTGCCCGCAGCGGCGGCAACTGAGGCAAGGCTGAGCAGGAGGACGGC^ 

GCGACAGGGGCCTGGAGGCCGGGACCGCGCGCCCGGCTGTGTCTCCGGGGGGTGCTGTCGCGGGCGCTGCCCCCGGCGCCGCTGCTGCCGCTGCTCTTTTGGCTGCTG 
CTGCTGCCGCTGCCCCGGGAGGCCGAGGCCGCTGCGGTGGCGGCGGCGGTGTCCGt-CTCGGCCGCAGCGGAGGCCAAGGAATGTGACCGGGGGTGTGTCAACGGCGGT 
CGCTGCAACCCTGGCACCGGCCAGTGCGTCTGCCCCGCCGGCTGGGTGGGCGAGC.-ATGCCAGCACTGCGGGGGCCGCTTCAGACTAACTGGATCTTCTGGGTTTGTG 
ACAGATGGACCTGGAAATTATAAATACAAAACGAAGTGCACGTGGCTCATTGAAGGACAGCCAA^ 

AGTTGGGACCATTTATATGTTTATGATGGGGACTCAATTTATGCAGCGCTAGTTGCTCCATTTAGTGGCCTCATTGTTGCTGAGAGAGATGGCAATGAGACTGTCCCT 
GAGGTTGTTGCCACATCAGGTTATGCCTTGCTGCATTTTTTTAGTGATGCTGCTTATAATTTGACTGGATTTAATATTACTTACAGTTTTGATATGTGTCC7U\ATAAC 
TGCTCAGGCCGAGGAGAGTGTAAGATCAGTAATAGCAGCGATACTGTTGAATGT^ 
AACTGTGGTTTTCCTCATCGAGGCATCTGCAATTCAAGTGATGTCAGAGGATGCTCCTGCT^ 

CAGTCATTTTGGACTCGAGAGGAATATTCTAACTTAAAGCTCCCCAGAGCATCTCATAAAGCTGTCGTCAATGGAAACATTATGTGGGTTGTTGGAGGATATATGTTC 

AACCACTCAGATTATAACATGGTTCTAGCGTATGACCTTGCTTCTAGGGAGTGGCTTCCACTAAACCGTTCTGTGAACAATCTGGTTGTTAGATATGGTC 

GCATTATACAAGGATAAAATTTACATGTATGGAGGAAAAATTGATTCAACTGGGA^ 

TTGACCCCTAAGGCAAAGGAGCAGTATGCAGTGGTTGGGCACTCTGCACACATTGTTAC^^ 

CTCTATGGATATATAAGCV^TGTGCAGGAATATGATTTGGATAAGAACACATGGAG 

GTTTACGACCATAGGACCAGGGCCCTATACGTTGATGGTGGCTACAAGGCTTTCAGTGC^ 

CAGATGTGGACCATTCTTAAGGACAGCCGATTTTTCCGTTACTTGCACACAGCT^ 

TCTATGAGCCATGGCGCCAAATGCTTCTCTTCAGATTTCA^ 

AGATTTGGCCATTCAGCAGTCfTACACAACAGCACCATGT^ 

GCGCATCGGAGTGAAGCCGCTTGTTTAGCJVGCAGGACCTGGTATTCGGTGTGTGTGG^ 

GAAGAAAAGTTAAAATCAG^TGTTTTTCCAAAAGAACTCTTGACCATGACAGATGTGACCAGCACACAG 

TGGTGCAATGACCATTGTGTCCCCAGGAACCACAGCTGCTCAGAAGGCCA 

AAG AAG AC CAGCT G C AGGAGCTGT G CCCTGG AC CAGAACTG CCAGTGGG AGCCGC<»G AATCAGGAGTGCATTG CCCTGGGCGAAAAT ATCTGTGGCATTGG CTGGC AT 

TTGGTTGGAAACTCATGTTTGAAAATTACTACTGCCAAGGAGAATTATGACAATGCTAAATTGTTCTGTAGGAACCACAATGCCCT 

AAGAAGGTAGAATTTGTCCTTAAGCAGCTGCGJ^TAATGCAGTCATCT^ 

TACTGGTGCTGGGAAGATATGTCCCCATTTACAAATAGTTTACTACAGTGGAT^^ 

CGGGGACTGAAGGCTGCJU^CCTGCATCAACCCACTCAATGGTAGTGTCTGTGAAAGGCCTGCAA 

GCATGTGGAGATTGCACCAGCGGCAGCTCTGAGTGCATGTGGTGCAGCA^ 

ATGGAATGGTATACGATGAGGACCTGCCCCCCTGAAAATTGTT 

AATACTGGCAAAGGGAAATGCATAGAGGGTTCCTATAAAGGACt^TGAAGATG^ 

ATGTGTCTAGAGGACAGCAGATACAACTGGTCTTTCATTCACTGTCCAGCT^ 

AACCTGACCACAGGCAAGCACTGCGAGACCTGCATATCTGGCTTCTACGGTGATCCCxACCAATGGAGGGAA 

TGCAACACCAACACGGGCAAGTGCTTCTGCACCACCAAGGGCGTC 

ACATGTTATTATACTCTTCTTATTGACTATCAGTTCA^^ 

AACAGGGATTTGGACATCTTCATCAATGCCTCCAAGAATTTCAACC^ 

GTTTCAAAAACCJ^CATTAAGGAGTACAAAGATAGTTTCTCTA 

tggccc^tcaaaattcaggtgcaaacrtgaacaatgaggacgcatggacacaggaagggg 
agcattaggggatatacctaatgttaaatgacgagttaatg<x;tgcagcacaccaacatggcatatg 
t aaaact t aaagt ataatt aaaaaaaaaaaaaxy^aaaaaaaaaaaaaaaaaa 
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^AVAAAAATEARLRRRTAATAALAGRSGGPHRPCTATGAWRPGPRARLCLPRVL^RALP 

RCNPGTGOCVCPAGWVGEOCOHCGGRFRLTGSSGFVTDGPGNY KYKTKCTWLI EGQPNRIMRLRFNH FATECSWDHLYVYDGDS 1 YAPLVAAFSGLI V PERDGNETVP 
EWATSGYALLHFFSDAAYNLTGFNITYSFDMCPNNCSGRGECKISNSSDTVECECSENWKGEACDI PHCTDNCGFPHRGICNSSDVRGCSCFSpWOGPGCSVpVPAN 

OS fwtreeysnlklprashkawngnimwwggymfnhsdynmvi^ 

LTPKAKEOYAWGHSAHIVTLKNGRVVMLVIFGHCPLYGYISNVOEYDLDKNTWSILHTQGALVOGGYGKSSyYDHRT 

Omwtilkdsrffrylhtavivsgtmlvfggnthndtsmshgakcfssdfmaydiacdrwsvlprpdlhhdwrfghsavlhnstmyvfggfn 

AHRSEAACLAAGPG1RCWNTGSSQCISWA1ATDEQEEKLK 

kktscrsgaldoncoweprnoecialpenicgigwhlvgnsclkittakenydnaklfcrnhnaluvsltto 

ywcwedmspftnslwmpsepsdagfcgilsepstr^ 

mewytmstcppencsgyctcshclegpgcgwc™^ 

nlttgkhcetcisgfygdptnggkcqpckcnghaslcntntgkcfcttkgvkgdecolcevenryognplrgtcyytllidyoftfslsoeddryytai^ 

NRDLDMFIKASKNFNLNITWAASFSAGTOAGEEMPVVSkTNIKEYKDSFSNEKFDFRNHPNITFFVYVSNFTWPIK 

irgytcmtswvohtnmayvyicnkpaccahvpnlkynkkkkkkkkkkkkkkkkkk 



F i . 1 q B 



BNSDOCID: <WO 0005373A2 t > 



WO 00/05373 PCT/US99/16484 

88 / -89 

ATGGTGGCGGTGGCCGCAGCGGCGGCAACTGAGGCAAGGCTGAGGAGGAGGACGGCGGCGAC^ 

GCGACAGGGGCCTGGAGGCCGGGACCGCGCGCCCGGCTGTGTCTCCCGCGGGTGCTGTCGCGGGXTGCTGGCGCCGCCGGCGCT 

CTGCTGCCGCTGCCCCGGGAGGCCGAGGCCGCTGCGGTGGCGGCGGCGGTGTCCGGCTCGGCCGCAGCCGAGGCCAAGGA 

CGCTGCJ^CCCTGGCACCGGCCAGTGCGTCTGCCCCGCCGGCTGGGTGGGCGAGCAATGCCAGCACTGCG 

ACAGATGGACCTGGAAATTATAAATACAAAACGAAGTGCACGTGGCTCATT 

AGTTGGGACCATTTATATGTTTATGATGGGGACTCAATTTATGCACCGCTAG 

GAGGTTGTTGCCACATCAGGTTATGCCTTGCTGCATTTTTTTAGTGATGCTGCTT^^ 

TGCTCW^CCGAGGAGAGTGTAAGATCAGTAATAGCA^ 

AACTGTGGTTTTCCTCATCGAGGCATCTG 

CAGTCATTTTGGACTCGAGAGGAATATTCTAAOT 

AACCACTCAGATTATAACATGGTTCTAGCGTATGACCTTGCTTCTAGGGAGTGGOT 

GCA^ATACAAGGATAAAATTTACATGTATGGAGGAAAA^^ 

TTGACCCCTAAGGCAAAGGAGCJ^^ 

CTCTATGGATATATAAGCAATGTGCAGGAATATGATTTGGATAACAA^ 
GTTTACGACCATAGCACCAGGGCCra 

CAGATGTGGACOlTTCTTAAGGACAGCCGATTTTTCCGTTA 

TCTATGAGCCATGGCGCOIAATGCTTC^^ 

AGATTTTOCCATTCAGCAGTCTTACACAACAGCACW 

GCGC^TCGGACTGAAGCCGCTTGTTTAGCAGCAGGACCTGGTA^ 

GAAGAAAAGTTAAAATCAGAATGTTTTTCCAAAAGAACT 

TGGTGCJ^TGACCATTGTGTCCCCAGGAACCACAGCTGCTCAGAAG^ 

AAGAAGACCAGCTGCAGGAGCTGTGCCCTGGACCAGAACTGCCAGTGG^ 

TGTGTGGGTCCATTACTTCAGCCTGCTT^^ 

ATGGGCTTTTTTGTTTTTAACTAAAATACAGTTCTTAAGTATTTGTTCCTACTGTCCT 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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RCNPGTGQCVCPAGWVGEQCQHCGGRFRLTGSSGFVTDGPGNYKYKTKCTWLI EGQPNR1MRLRFNHFATECSWDH LYVYDGDS j YAPLVAAFSGLI VPERDGNETVP 
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LTPKAKEOYAWGHSAHIVTLKNGRVVMLVIFGHCPLYGYISNVQEYDLDKNTWSILHTQGALVQGGYGHSSVYDHRTI^LYVHGGYKAFSANKYRIADDLYRYDVDT 
^TILKDSRFFRYLHTAVIVSGTMLVFGGNTHNpTSMSHC^ 

AHRSEAACI^GPGI RCVWNTGSSQC I SWAWTDEQEEKLKSEC^^^ 

KKTSORSCAIjbQNCQWEPRNQECIALPGRPCRVl LVCVGPLLQPASPNTVQPKLNLAEGKSFCPFI PHTSIMGFFVFNNTVLKYLFLLSFEIKNI LCCSVKKKKKKKK 
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diagnosis and treatment of mammalian body weight disorders, including obesity, cachexia, and anorexia, and for the identification 
of subjects susceptible to such disorders. Further, the invention relates to methods of using the mahogany gene and-gene products 
of the invention for the identification of-compounds which modulate the expression of the mahogany -gene and/or the activity of the 
mahogany gene product. Such compounds can be useful as therapeutic agents in the treatment of mammalian body weight disorders, 
including obesity, cachexia, and anorexia. 
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METHODS AND COMPOSITIONS FOR THE DIAGNOSIS AND 
TREATMENT OF BODY WEIGHT DISORDERS . INCLUDING OBESITY 

Priority of provisional application no. 60/093,630 filed 
5 on July 21, 1998 and of provisional application no. 
60/104,978 filed on October 20, 1998, each of which is 
incorporated herein by reference in its entirety, is claimed 
under 35 U.S. C. § 119(e)(1). 

10 INTRODUCTION 

The present invention relates to mammalian mahogany 
genes, including the human mahogany gene, which are novel 
genes involved in the control of mammalian body weight. The 
invention encompasses nucleotide sequences of the mahogany 

5 gene, host cell expression systems of the mahogany gene, and 
hosts which have been transformed by these expression 
systems, including transgenic animals . The invention also 
encompasses novel mahogany gene products , including mahogany 
proteins, polypeptides and peptides containing amino acid 
sequences mahogany proteins, fusion proteins of mahogany 

0 proteins polypeptides and peptides, and antibodies directed 
against such mahogany gene products. 

The present invention also relates to methods and 
compositions for the diagnosis and treatment of mammalian 
body weight disorders, including obesity, cachexia, and 

5 anorexia, and for the identification of subjects susceptible 
to such disorders. Further, the invention relates to methods 
of using the mahogany gene and gene products of the invention 
for the identification of compounds which modulate the 
expression of the mahogany gene and/or the activity of the 
mahogany gene, product . Such compounds can be useful as 
therapeutic agents in the treatment of mammalian body weight 
disorders, including obesity, cachexia, and anorexia. 
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■a 7s rvdVl OTTND ™ ™* THVENTION 

2 ' Obesity represents the most prevalent of body weight 
^ and it is the most important nutritional d,sorder 

tie western world, with estimates of its prevalence 
L ging 30 % to S0 % within the middled population. 
5 Tther Lay weight disorders, such.s ^ of 

which together affect approximately 
bulimia nervosa, wmcn i-ua „ serious 

the female population of the western world, also pose ser 
h ItHhreats Further, such disorders as anorexia and 

«. — — r teat r;^ s other 

10 diseases such as cancer, cystic fibres^ an * 
Obesity, defined as an ««ss of body fat 

also contributes to other diseases. For 
lean body mass, also cont incre ased incidence 

, his disorder is responsible for increa 
example, tms u . ( hypertension, 

ot diseases such as coronary artery disease, hype 
i* v diabetes hvperlipioemia. and some cancers (See. 

" Kisnina PM et al . . Xrt*. Me»b. 43, SS4-S58, Orundy, 
Nishina .FM ^ 64i 73i) obesity 

s .„. s, Barnett, J. P.. "9, ^ 
is not merely a behavioral problem, i.e., the 
voluntary hyperphagia. -ther. the differentially 
20 composition observed between obese and normal sub 3 ects 
lf . from differences in both metabolism and 
Ituro I. c metabolic interactions. These differences se-n to 
:l t some extent, due to differences in expression 
and/or level of 3 ene 

" 3l " Tne^piaemioioLToL^y Wrongly shows that the 

dis orL exhibits inherits characteristics (Stunxar^ l£0. 

" i =p :r , - 

estimates of heritability of 80-90% (Simopoulos. A.P. * 
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Childs. B., eds., 1989, in "Genetic Variation and Nutrition 
in Obesity", World Review of Nutrition and Diabetes 63, S. 
Karger, Basel, Switzerland; Borjeson, M. , 1976, Acta. 
Paediatr. Scand. 65: 279-287). 
5 In other studies, non-obese persons who deliberately 

attempted to gain weight by systematically over-eating were 
found to be more resistant to such weight gain and able to 
maintain an elevated weight only by very high caloric intake . 
In contrast, spontaneously obese individuals are able to 
10 maintain their status with normal or only moderately elevated 
caloric intake. In addition, it is a commonplace experience 
in animal husbandry that different strains of swine, cattle, 
etc., have different predispositions to obesity. Studies of 
the genetics of human obesity, and of animal models of 
obesity demonstrate that obesity results from complex 
15 defective regulation of both food intake, food induced energy 
expenditure, and of the balance between lipid and lean body, 
anabolism. 

There are a number of genetic diseases in man and other 
species which feature obesity among their more prominent 

20 symptoms, along with, frequently, dysmorphic features and 

mental retardation. For example, Prader-Willi syndrome (PWS; 
reviewed in Knoll, J. H. et al . , 1993, Am. J. Med. Genet. 46: 
2-6) affects approximately 1 in 20,000 live births, and 
involves poor neonatal muscle tone, facial and genital 

25 deformities, and generally obesity. 

In addition to PWS, many other pleiotropic syndromes 
have been characterized which include obesity as a symptom. 
These syndromes are genetically straightforward, and appear 
to involve autosomal recessive alleles. Such diseases 
include, among others, Ahlstroem, Carpenter, Bardet-Biedl , 

30 Cohen, and Morgagni- Stewart -Monel Syndromes. 

A number of models exists for the study of obesity (see, 
e.g., Bray, <5. A., 1992, Prog. Brain Res. 93: 333-341; and 
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Bray, G.A. , 1989, Amer . J. Clin. Nutr. 5: 891-902). for 
example, animals having mutations which Lead to syndromes 
that include obesity symptoms have also been identified. 
Attempts have been made to utilize such animals .as models for 
5 the study of obesity, and the best studied animal models to 
date for genetic obesity are mice. For reviews, see, e.g., 
Friedman, J.M. et al . , 1991, Mamm. Gen. 1: 130-144; Friedman, 
J.M. and Liebel, R.L. , 1992, Cell 69: 217-22*0. 

Studies utilizing mice have confirmed that obesity is a 
very complex trait with a high degree of heritability. 
10 Mutations at a number of loci have been identified which lead 
to obese phenotypes. These include the autosomal recessive 
mutations obese (oJb) , diabetes (d£>) , fat (fat) , and -tubby 
( tuJb) . 

The dominant Yellow mutation {Ay) at: the agouti locus 
!5 causes a pleiotropic syndrome which causes moderate adult 
onset obesity, a yellow coat color, and a high incidence of 
tumor formation (Herberg, L. and Coleman, D.L., 1977, 
Metabolism 26:59), and an abnormal anatomic distribution of 
body fat (Coleman, D.L., 1978, Diabetologia 14:141-148). The 
20 mutation causes the widespread expression of a protein which 
is normally seen only in neonatal skin /{Michaud, E. J. et 
al., 1994, Genes Devel. 8:1463-1472). The agouti protein has 
been reported to be a competitive antagonist of a-MSH binding 
to the melanocortin receptors MCI - R and MC4-R in vitro <liu -et 
al., 1996, Nature 371:799-802), and the authors speculated 
that de-regulated ubiquitous expression of agouti may lead to 
obesity by antagonism of melanocortin receptors expressed 
outside the hair follicles . 

Mahogany (mg) and mahoganoid (md) are mutations that 
suppress the phenotypic effects of agouti protein in viyo 
30 (Lane and Green, I960, J. Hered. 51: 228-230). The mahogany 
and mahoganoid mutation have been mapped to mouse chromosomes 
2 and 16, respectively ^<3reen, 1989, "Catalog o£ mutant genes 
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and polymorphic loci", pp. 12-403 in Genetic Variants and 
Strains of t he Laboratory Mouse , Lyon, M . F. and Searle, 
A.G., eds., Oxford University Press, Oxford). Mutations of 
both mg and md have been shown to suppress the effects of 
agouti on obesity as well as on coat color (Miller et al . , 
5 1997, Genetics 146: 1407-1415) . 

In summary, therefore, obesity , which poses a major, 
worldwide health problem, represents a complex, highly 
heritable trait. Given the severity, prevalence, and 
potential heterogeneity of such disorders, there exists a 
10 great need for the identification of those genes that 
participate in the control of body weight. 

3 • SUMMARY OF THE INVENTION 

The present invention relates to the identification of 

l5 novel nucleic acid molecules and proteins encoded by such 

nucleic acid molecules that are involved in the control of 

mammalian body weight, and which, further, are associated 

with mammalian body weight disorders such as obesity, 

cachexia, and anorexia. The nucleic acid molecules of the 

present invention represent the genes corresponding to the 
20 -i • i_ 

mammalian mahogany gene, including the human mahogany gene. 

In particular, the compositions of the present invention 

include nucleic acid molecules which comprise/the following 

sequences: (a) nucleotide sequences of the mahogany gene, 

including, e.g. , murine mahogany sequences as shown in FIGS. 

25 2A, 3B-D, 6A-B, 8A-C, and 9A, as well as allelic variants and 
homologs thereof, and human mahogany sequences, as shown, 
e -q w in FIGS. 10A, 18A-C, 19A-C and 20A-B, as well as. 
allelic variants and homologs thereof ; (b) nucleotide 
sequences that enoode the mahogany gene product amino acid 

3Q sequences, as shown, e.g. , in in FIGS. 2B, 8D, 9B, 10B, 17A-D, 
18B-D, 1 9D and 20G; (c) nucleotide sequences that encode 
portions of the mahogany gene product corresponding to its 
functional domains 
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and individual exons; (d) nucleotide sequences -comprising the 
novel mahogany gene sequences disclosed herein that encode 
mutants of the mahogany gene product in which all or a part 
of one or more of the domains is deleted or altered, as 
5 shown, e.g. , in FIG. 6; (e) nucleotide sequences that encode 
fusion proteins comprising the mahogany gene product, or -one 
or more of its domains fused to a heterologous polypeptide ; 
(f) nucleotide sequences within the mahogany -gene , as well as 
chromosome sequences flanking the mahogany gene., see, e.g., 
FIG. 3, which can be utilized as part of the methods of the 

10 present invention for the diagnosis of mammalian body weight 
disorders, including obesity, cachexia, and anorexia, which 
are mediated by the mahogany gene, as well as for the 
identification of subjects susceptible to such disorders; (g) 
nucleic acid sequences that hybridize to the above described 

15 sequences under stringent or moderately stringent conditions, 
particularly human mg homologs. The nucleic acid molecules 
of the invention include, but are not limited to, cDNA and 
genomic DNA sequences of the mahogany -gene. 

The present invention also encompasses expression 

20 products of the nucleic acid molecules listed above; i.e., 
proteins and/or polypeptides that are encoded -by the above 
mahogany nucleic acid molecules. 

Agonists and antagonists of the mahogany gene and/or 
gene product are also included in the present invention. 
Such agonists and antagonists will include, for example, 

25 

small molecules, large molecules, and antibodies directed 
against the mahogany gene product. Agonists and antagonists 
of the invention also include nucleotide sequences, such as 
antisense and ribozyme molecules, and gene or regulatory 
sequence replacement constructs, that can £>e used to inhibit 
30 or enhance expression of the mahogany gene. 

The present invention further encompasses cloning 
vectors/ including expression vectors, that contain tlie 
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nucleic acid molecules of the invention and can be used to 
express those nucleic acid molecules in host organisms- The 
present invention also relates to host cells engineered to 
contain and/or express the nucleic acid molecules of the 
5 invention. Further, host organisms which have been 
transformed with these nucleic acid molecules are also 
encompassed in the present invention . Host organisms of the 
invention include organisms transformed with the cloning 
vectors described above, e.g. , transgenic animals , 
particularly non-human transgenic animals , and particularly 
transgenic non-human mammals. 

The transgenic animals of the invention include animals 
that express a mutant variant or polymorphism of a mahogany 
gene, particularly a mutant variant or polymorphism of a 
mahogany gene that is associated with a weight disorder such 

!5 as obesity, cachexia , or anorexia ..* The transgenic animals of 
the invention further include those that express a mahogany 
transgene at higher or lower levels than normal. The 
transgenic animals of the invention further include those 
which express the mahogany gene in all their cells, . "mosaic" 

20 animals which express the mahogany gene in only some of their 
cells, and those in which the mahogany gene is selectively 
introduced into and expressed in a specific cell type(s) . 
The transgenic animals of the invention also include "knock- 
out" animals. Knock-out animals comprise animals which have 
been engineered to no longer express the mahogany gene . 

25 

The present invention also relates to methods and 
compositions for the diagnosis of mammalian body weight 
disorders/ including obesity, cachexia, and anorexia, as well 
as for the identification of subjects susceptible to such 
disorders. Such methods comprise, for example, measuring 
30 expression of the mahogany gene in a patient sample, or 

detecting a mutation in the mahogany gene in the genome of a 
mammal, including a human, suspected of exhibiting such a 

- 7 - 
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weight disorder. The nucleic acid molecules of the invention 
can also be used as diagnostic hybridization probes, or as 
primers for diagnostic PCR analysis to identify of mahogany 
gene mutations, allelic variations, or regulatory defects, 
5 such as defects in the expression of the mahogany gene. Such 
diagnostic PCR analyses can be used to diagnose individuals 
with a body weight disorder associated with a particular 
mahogany gene mutation, allelic variation, or regulatory 
defect. Such diagnostic PCR analyses can also be used to 
identify individuals susceptible to such body weight 
10 disorders and hyperphagia. 

Methods and compositions, including pharmaceutical 
compositions, for the treatment of body weight disorders such 
as obesity, cachexia, and anorexia are also included in the 
invention. Such methods and compositions are -capable of 
15 modulating the level of mahogany gene expression and/or -the 
level of activity of the mahogany gene product. Such methods 
include, for example, modulating the expression of the 
mahogany gene and/or the activity of the mahogany gene 
product for the treatment of a body weight disorder which is 
20 mediated by some other gene, for example by the agouti gene. 
The invention still further relates to methods for 
identifying compounds which modulate the expression of the 
mammalian mahogany gene and/or the synthesis or activity of 
mammalian mahogany gene products. Such compounds include 
therapeutic compounds which can be used as pharmaceutical 
2 * compositions to reduce or eliminate the symptoms of mammalian 
body weight disorders such as obesity, cachexia, and 
anorexia. Cellular and non-cellular assays are -descried 
that can be used to identify compounds that interact with the 
mahogany gene and/or gene product, e.g., modulate the 
30 activity of the mahogany gene and/or bind to tiie mahogany 
gene product. Such cell -based assays of the invention 
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utilize cells, \ cell lines, or engineered cells or cell lines 
that express the mahogany gene product. 

In one embodiment, such methods comprise contacting a 
compound to a cell that expresses a mahogany gene, measuring 
5 the level of mahogany gene expression, gene product 

expression, or gene product activity, and comparing this 
level to the level of mahogany gene expression, gene product 
expression, or gene product activity produced by the cell in 
the absence of the compound, such that if the level obtainjed 
in the presence of the compound differs from that obtained in 

10 

its absence, a compound that modulates the expression of the 
mammalian mahogany gene and/or the synthesis or activity of 
mammalian mahogany gene products has been identified. 

In an alternative embodiment , such methods comprise 
administering a compound to a host, e.g., a transgenic animal 

15 that expresses a mahogany transgene or a mutant mahogany 
transgene, and measuring the level of mahogany gene 
expression, gene product expression, or gene product 
activity. The measured level is compared to the level of 
mahogany gene expression, gene product expression, or gene 

20 product activity in a host that is not exposed to the 

compound, such that if the level obtained when the host is 
exposed to the compound differs from that obtained when the 
host is not exposed to the compound, a compound that 
modulates the expression of the mammalian mahogany gene 
and/or the synthesis or activity of mammalian mahogany gene 
products, and/or the symptoms of a mammalian body weight 
disorder, such as obesity, cachexia, or anorexia, has been 
identified. 

The Example presented in Section 6, below, describes the 
genetic and physical mapping of the . mahogany gene to a 
30 specific 700 kb interval of mouse chromosome 2. The example 
presented in Section 7, below, describes the identification 
of a transcription unit within this chromosome interval, 
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referred to herein as the MG gene, which represents the 
mahogany gene. The expression and sequence analysis of this 
candidate mahogany gene is described in the example presented- 
in Section 8, below. These experiments prove that the 
candidate gene MG is indeed the mahogany gene . The example 
5 presented in Section 9, below, presents data demonstrating 
that the mechanism of mahogany action is specific for diet- 
induced obestity, therefore supporting the use of mahogany 
antagonists as a specific therapeutic for treatment of diet- 
induced, body weight disorders. The example presented in 
10 Section 10, below, presents the identification and 

characterization of the human mg gene, variants thereof and 
polypeptides encoded by the human mahogany sequences . 

- DEFINITIONS 

As used herein, the following terms shall have the 

15 

abbreviations indicated. 

BAC, bacterial artificial chromosomes 

bp, base pair(s) 

EST, expressed sequence tag 

mg, mahogany gene 

RFLP, restriction fragment length -polymorphism 
RT-PCR, reverse transcriptase PCR 

SSCP, single-stranded conformational polymorphism 
SSLP, simple sequence length polymorphisms 
STS, short tag sequence 
25 YAC, yeast artificial chromosome 

4. BRIEF DESCRIPTION OF THE FIGURES 

FIG . .1 . Physical map of the mahogany interval of mouse 
chromosome 2 . 

30 

FIG. 2. Panels A<l)-A(9) : cDNA nucleotide sequence of 

the wild- type (C57BL/6J) murine mahogany gene (SEQ ID NO: 1) , 

. - lO - 
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including the 5' and 3' untranslated regions, and Panel B: 
the derived amino acid sequence (SEQ ID NO: 2) of the 
mahogany gene product . . . V 

FIG • 3 • . Genomic structure and nucleotide sequences 
5 derived from the wild- type (C57BL/6J) mouse genomic regions 
containing the mg gene. Panel A, genomic structure; Panel 
B(l)-B(17), genomic sequence c56 (SEQ ID NO: 3); Panel C(l)- 
C(6), genomic sequence c96 (SEQ ID NO: 4); Panel D (1 ) -D (86) , 
genomic sequence of cllO/111 (SEQ ID NO: 5) . 

10 

FIG - 4 • Structural depiction of MG cDNA without 
introns. CUB=CUB domain, metal =metallothionin domain; T- 
t ransmembrane domain. 

15 FIG. 5(1) -5 (4). Nucleotide sequence of primers used 

to amplify each of the exons in the mg gene. 

FIG - 6 • Nucleotide sequence of the wild-type (SEQ ID 
NO: 6) and mahogany mutant (SEQ ID NO: 7) sequences in exon 
15 of the MG gene. Bases shown in bold are deleted in Mg3J 
mutant mg . 

FIG - 1 ' Differential 5' start sequences in the murine 
mahogany gene showing splice forms akml003 and akml0O4. 

2 5 

FIGS. 8A-E. Panels A-C, cDNA sequence (SEQ ID NO: 8) 

from one form of the differential 5' start site found in the 
murine (akml003), Panel D, amino acid sequence (SEQ ID NO: 9) 
encoded by the cDNA of Panels A-C; Panel E, hydropathy plot 
of the akml003 amino acid sequence. 



30 



FIGS. 9A-C. Panel A, cDNA sequence <SEQ ±D NO: 10) 
from one form of the differential 5' start site found in the 
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murine (akml004) ; Panel B, amino .acid sequence (SEQ- ID 

NO: 11) encoded by the cDNA of Panel A; Panel C, hydropathy 

plot of the akml004 amino acid sequence. 

FIGS. lOA^B. Nucleotide sequence (SEQ ID NO: 12) of a 
5 contig containing a portion of the human MG cDNA ; panel A(l)- 
A(7) and the translated .aminp. acid sequence (SEQ ID NO: 13), 
panel B. 

FIGS . I1A-B . Effect of mg on MC4r -/- induced weight 
10 gain in females (FIG. 11A) and males (FIG. 11S) ; values 
depicted are the mean +/- SD within a designated time 
interval . 

FIGS . 12A-D. Effect of mg on monogenic obese mutants 
15 Lepr** (FIG. 12A) , tuJb (FIG. 12B) , Cpe rat (FIG. 12C) , and on 
high fat diet .induced obesity (FIG . 12D) ; the values 
indicated are the mean +/ - SD of the weight length ratio for 
each animal . 

FIG . 13 . Genetic and physical map of the region 

20 

surrounding the mg locus; all MIT markers are presented with 
shortened names, e.g., D2MIT77 is indicated as D2M77 ; 
locations of loci which also mapped on the iiuman cytogenetic 
map are indicated in parentheses after the .gene symbol . 

Panel A . The genetic map of the mg gene region on 
25 the Millennium BSB mapping panel (Misumi, D.J. -et al . , 

1997, Science 278:135-138);. 

Panel B. The genetic map -obtained from crosses 
segregating mg mutant alleles ; 

Panel C. The -1 Mb BAC contig across the mg gene 
30 region of mouse Chromosome 2 ; 

. Panel D. The transcriptional units i<ientif ied in 
the mg region; the filled ±>ox indicates the. nig gene, 

" 1 ? " 
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whereas the hatched box is a member of the High Mobility 
Group (HMG) gene family which sits between coding exons 
21 and 22 of the mg gene. 

FIG - 14 * Northern blot analysis with C3H/HeJ (lane 1) , 
5 and three mutant alleles of mg: C3HeB/Fe J-wg 3J (Lane 2), 

LDJ/Le-jngr (Lane 3) , and C3H/HeJ-/ngr J (Lane 4); the size marker 
is shown on the left, and hybridization with actin is shown 
below for loading comparisons. 

10 FIGS • — 15A-D. In situ hybridization data: FIG. 15A 

demonstrates widespread expression o f wg throughout the mouse 
brain is seen in an antisense autoradiographic image of a 
C3H/HeJ brain at the level of the 3rd ventricle; decreased 
expression in mg mutants is documented : in selected antisense 

15 darkfield images of 10 /im whole mount cross sections of the 
ventromedial hypothalamic nucleic (VMH) of C3H/HeJ (FIG. 
15B) , LDJ/Le-/ngr (FIG. 15C) , and C3HeB/FeJ- J mg JJ (FIG. 15D) . 

FIGS. 16A-B. Alignment of the MG protein sequence with 
20 itS famil Y members showing the transmembrane region 

(indicated in brackets) and cytoplasmic tail (FIG. 16A) ; and 
a schematic of the- molecular modular architecture of MG (FIG 
16B) . 

■ FIGS • — 17A-D. Sequence alignment of the predicted MG 

25 

protein sequence (top) with the Attract in protein sequence. 
Characteristic MG domains are as indicated. See Section 10.2 
for details. 

FIG. 18A-D. Panels A (1-8) : cDNA nucleotide sequence 
30 (SEQ ID NO: 14) of the long splice variant of the human ; 
ortholog of the mahogany gene, and Panels B-D: the derived 
amino acid sequence (SEQ ID NO: 15) of the mahogany gene 
product which it encodes. 
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FIG. 19A-D. Panels A-C: cDNA nucleotide -sequence (SEQ . 
ID NO: 16) of a shorter splice variant of the human ortholog 
of the mahogany gene, and Panel D: the derived amino acid 
sequence (SEQ ID NO: 17) of the mahogany gene product which 
it encodes . 

5 

FIG. 20A-C. Panels A-B: cDNA nucleotide sequence (SEQ 
ID NO: 18) of a second shorter splice variant of the human, 
ortholog of the mahogany gene, and Panel C: the derived 
amino acid sequence" (SEQ ID NO: 19) of the mahogany -gene 
10 product which it encodes. 

5. DETAILED DESCRIPTION OF THE INVENTION 

Described herein is the identification of the novel 
mammalian mahogany (rug) gene, including the human mahogany 
25 gene, which is involved in the control of mammalian body 

weight. Also described are recombinant mammalian, including 
human mahogany DNA molecules, -cloned genes, and -degenerate 
variants thereof- The compositions of the present invention 
further include mg gene products (e.gr., proteins) that are 
encoded by the mg DNA molecules of the invent ion , and the .. 

20 

modulation of mg gene expression and/or mg gene product 
activity in the treatment of mammalian body weight disorders, 
including obesity, cachexia, and anorexia. Also described 
herein are antibodies against mg gene products (e.g., 
proteins) , or conserved variants or fragments thereof, and 

25 nucleic acid probes useful for the identification of mg gene 
mutations, and the use of such nucleic acid probes in 
diagnosing mammalian body weight disorders, including 
obesity, cachexia, and anorexia. Further described are 
methods for the use of the mg gene and/or mg gene products in 

30 the identification of compounds which raxiulate the activity 
of the mg gene product. 
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5.1- ■ THE MAHOGANY GENE 

The mahogany genes are novel mammalian genes involved in 
the control of body weight. . The nucleic acid sequences of 
the mahogany genes,, including the murine mahogany gene 
sequences shown in FIGS. 2A, 3B-D, 6A-B, 8A-C, and 9A, as 
well as allelic variants and homologs thereof , and human 
mahogany sequences, as shown, e.g. . in FIGS. 10A, 18A, 19A-C 
and 2 OA- B, as well as allelic variants and homologs thereof. 
The genomic sequence and structure, i.e., the intron/exon 
structure, of the mahogany genes have also been elucidated, 
FIG. 3 

The mahogany gene nucleic acid molecules of the present 
invention . comprise : (a) the DNA sequence shown in FIGS. 2A, 
3, 6A-B, ;8A-C, 9A, 10A, 18A, 19A-C or 20A-B, or any DNA 
sequence that encodes the amino acid sequence, of the mahogany 
gene product shown in FIGS. 2B, 8D, 9B, 10B, I7A-D, 18B-D, 
19D or 20C; (b) nucleotide sequences comprising the novel 
mahogany sequences disclosed herein that encode mutants of 
the mahogany gene product in which all or a part of one or 
more of the domains is deleted or altered, as shown, e.g. , 
FIG. 6; (c) nucleotide sequences that encode fusion proteins 
comprising a mahogany gene product, or one of its domains 
fused to a heterologous polypeptide; and (d) nucleotide 
sequences within a mahogany gene , nucleotide sequences on the 
chromosome flanking the mahogany gene, see, e.g. . FIG. 3 and 
human genomic sequences syntenic to the sequences depicted in 
FIG. 3, which can be utilized as part of the. methods of the 
invention for identifying and diagnosing individuals who 
exhibit or are susceptible to weight disorders, including 
obesity, cachexia, and anorexia. 

The mahogany nucleotide sequences of the invention 
further comprise: (a) any nucleotide sequence that 
hybridizes to the complement of a nucleic acid molecule that 
encodes a mahogany gene product under highly stringent 
conditions , e.g., hybridization to filter-bound DNA in 0.5 M 
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NaHPO«, 7% sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65° C, 
and washing in 0.1xSSC/0.1% SDS at 68°C (Ausubel F.M. et al., 
eds. , 1989, Current Protocols in Molecular Biology, Vol. I, 
Green Publishing Associates, Inc. , and John Wiley & Sons, 
5 Inc., New York, at p. 2 . 10 . 3 ) part icularly human mg 

sequences, FIG. 10; and <b) any nucleotide sequence that 
hybridizes to the complement of a nucleic acid molecule that: 
encodes a mahogany gene product under less stringent 
conditions , such as moderately stringent conditions , 
e.g. , washing in 0 . 2xSSC/0 . 1% SDS at 42 °C (Ausubel et al., 

10 

1989, supra) , yet which still encodes a functionally 
equivalent mahogany gene product. 

"Functionally equivalent " , as utilized herein, ref ers to 
a gene product (e.gr., a protein) capable of -exhibiting a 
substantially similar in vivo activity as the endogenous mg 

15 gene products encoded by the mg gene sequences -described 

above. The in vivo activity of the mg gene product, as used 
herein, refers to the ability of the mg gene product, when 
present in an appropriate cell type, to ameliorate, prevent, 
or delay the appearance of the mahogany phenotype relative to 

20 its appearance when that cell type lacks a functional 
mahogany gene product . 

The invention also includes nuclei-c acid molecules, 
preferably DNA molecules, that are the implements of the 
nucleotide sequences described above. Among the nucleio acid 

2 5 molecules of the invention are deoxyoligonucleot i<les 
("oligos 11 ) which hybridize under highly stringent or 
moderately stringent conditions to the mahogany nucleic acid 
molecules described above. Exemplary highly stringent 
conditions may refer, e.g., to washing in 6xSSC/0.05% sodium 
pyrophosphate at 37°C (for 14 -base oligos) , 46 °C (for 17 -base 

30 oligos), 55°C (for 2 0 -base - oligos) , and -60°C (for 23-base 
oligos) . These nucleic acid molecules -may -encode or act as 
antisense molecules, -useful, for example, in mahogany gene 
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regulation,, and/or. as antisense primers in amplification 
reactions of mahogany gene nucleic acid sequences With 
respect to mahogany gene regulation, such techniques can be 
used to regulate, for example, weight disorders such as 
obesity, cachexia, or anorexia .." Such sequences may also be 
5 used as part of ribozyme and/or triple helix sequences, which 
are also useful for mahogany gene regulation. Still... further , 
such molecules may be used as components of diagnostic 
methods whereby, for example,, the presence of a particular 
mahogany allele associated with a weight disorder, such as 
10 obesity, cachexia, or anorexia, may be detected. Among the 
molecules which can be used for diagnostic methods, such as 
those which involve amplification" of genomic mahogany 
sequences, are primers or probes that can routinely be 
obtained using the genomic and cDNA sequences disclosed 

- herein . 
15 

In one embodiment, the nucleic acid molecules of the 
invention do not include nucleic acid molecules that consist 
solely of the nucleotide sequence that encodes the attract in 
protein sequence depicted in FIGS. 17A-D. 

The mahogany nucleic acid sequences of the invention 
further include fragments of the nucleic acid sequences 
described above. For example, mahogany nucleic acid 
fragments can include fragments of at least 10, 12, 15, 20, 
30, 40, 50, 100, 150, 200, 300, 400, 500, 600, 700, 800, 900, 
1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 

25 2000 or more nucleotides. 

The nucleotide sequences of the present invention also 
include (a) DNA -vectors that contain any of the foregoing 
mahogany coding sequences and/or their complements; (bj DNA 
expression vectors that contain any of the foregoing mahogany 

^ coding sequences operatively associated with a regulatory 
element that directs the expression of the coding sequences; 
and (c) genetically engineered host cells and organisms that 
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contain any of the foregoing mahogany coding sequences 
operatively associated with a regulatory element that -directs 
the expression of the coding sequence in the host: cell- As 
used herein, regulatory elements include, but are not limited 
5 to inducible and non- inducible prompters, enhancers, 

operators, and other elements known to those skilled in the 
art that drive and regulate gene expression. Such regulatory 
elements include, but are not limited to, the cytomegalovirus 
hCMV immediate early gene, the early or late promoters -of 

SV4 0 adenovirus,, the lac system, .the trp system, the TAC 

1° - ^ 

system, the TRC system, the major operator and promoter 

regions of phage A, the control regions of fd coat protein, 

the promoter for 3 1 -phosphoglycerate kinase, the promoters of 

acid phosphatase,, and the promoters of the yeast alpha-mating 

factors . 

15 in addition to the mahogany gene sequences described 

above, homologs of such sequences , exhibiting extensive 
homology to one or more domains of the mahogany gene product 
can be present in other species. In a preferred embodiment, 
the mahogany gene homologue maps to a chromosomal region that 

2 0 is syntenic to the chromosomal region of the mahogany gene. 
In a particularly preferred embodiment, a human mahogany gene 
homologue sequence maps to a human chromosome region tbat is 
syntenic to the region of mouse chromosome 2 to which the 
murine mahogany gene maps, namely 20plS, and comprises the 
contiged human MG cDNA provided herein. Further, there can 

25 

also exist homologue genes at other genetic loci within the 
genome of the same species which encode proteins having 
extensive homology to one or more domains of the mahogany 
gene product. Such mahogany homologs can include, for 
example, secreted forms of the mahogany sequences, see, e . , 
30 Duke -Cohan, J.S . et al . (1998, Proa. Natl. Acad. ^Sci > JJ.S.A. 
95:11336-11341) . Such sequences, can be used, for example, 
in the screening assays, described in Section "S .4 .2 below, 
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for compounds that interact- with the mahogany gene and/or its 
gene product and that may therefore be useful in treating and 
ameliorating body weight disorders. 

Other mahogany homo logs can be identified and readily 
5 isolated, without undue experimentation, by molecular 
biological techniques well known in the art, and are 
therefore within the scope of the present invention. As an 
example, in order to clone a human mahogany gene homologue 
using isolated murine mahogany gene sequences, such murine 
mahogany gene sequences may be labeled and used to screen a 

10 

cDNA library constructed from mRNA obtained from appropriate 
cells or tissues derived from the organism (in this case, 
human) of interest. With respect to the cloning of such a 
human mahogany homologue, a human cDNA library may, for 
example be used for screening, such as a cDNA library 

15 obtained from mRNA isolated from brain tissues, particularly 
containing hypothalamic regions. 

The hybridization washing conditions used should be of a 
lower stringency when the cDNA library is derived from an 
organism different from the type of organism from which the 

20 labeled sequence was derived. With respect to the cloning of 
a human mahogany homologue, for example, hybridization can be 
performed for 4 hours at 65 °C using Amersham Rapid Hyb™ 
buffer (Cat. #RPN1639) according to manufacturer's protocol, 
followed by washing, with a final washing stringency of 
1.0xSSC/0.1% SDS at 50°C for 20 minutes being preferred. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 
specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such 
conditions see, for example, Sambrook et al . , 1989, Molecular 

30 Cloning, A Laboratory Manual , Cold Springs Harbor Press, 
N. Y . ; and Ausubel et al \ , 1989, Current Protocols in 
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Molecular Biology, Green Publishing Associates and Wiley 
Interscience, N.Y. 

Alternatively, the labeled fragment may be used to 
screen a genomic library derived from the organism of 
5 interest, again, using appropriately stringent conditions. 

Further, a mahogany gene homologue may be isolated from 
nucleic acid of the organism of interest by performing PCR 
using two degenerate oligonucleotide primer pools designed on 
the basis of amino acid sequences within the mahogany gene 
product disclosed herein. The template for the reaction -may 

10 

be cDNA obtained by reverse transcription of mRNA prepared 
from, for example, human or non-human cell lines or tissue 
known or suspected to express a mahogany gene allele. 

The PCR product may be subcloned and sequenced to ensure 
that the amplified sequences represent the sequences of a 

15 mahogany gene nucleic acid sequence. The PCR fragment may 

then be used to isolate a full length cDNA clone by a variety 
of methods. For example, the amplified fragment may be 
labeled and used to screen a cDNA library, such as a 
bacteriophage cDNA library. Alternatively, the labeled 

2o fragment may be used to isolate genomic -clones' via the 

screening of a genomic library. This method has been used to 
isolate sequences encoding each of the murine MG gene exons 
as well as to isolate contigs containing the human MG 
sequences provided herein, FIG. 10. 

PCR technology may also be utilised -to isolate full 

25 

length cDNA sequences. For example, RNA may be isolated, 
following standard procedures, from an appropriate cellular 
or tissue source (i.e. , one known, or suspected, to express 
the mahogany gene) . A reverse transcription reaction -may :be 
performed on the RNA using an oligonucleotide primer specific 
30 for the most 5' end of the amplified fragment for ti*e priming 
of the first strand synthesis. The r-esulting RNA/DNA hybrid 
may then be "tailed" with guanines using a -standard terminal 
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H, and second strand synthesis may then be primed with a 
poly- C primer. Thus, cDNA sequences upstream of the 
ampUfaed fragment may easily be isoaated. For a review of 
5 cloning strategies which may be used, see e.g. . sambrook et 

al. , 1989 supra. 

Mahogany gene sequences may additionally be used to 
isolate mutant mahogany alleles . Such mutant alleles may be 
isolated from individuals either known or proposed to have a 
10 ~vpe "hich contributes to the symptoms of body weight 
disorders, such as obesity, cachexia, or anorexia or disorders 
associated with hyperphagia. Mutant alleles and mutant 
allele products may then be utilized in the therapeutic and 
diagnostic systems described below. Additionally, such 
mahogany gene sequences can be used to detect mahogany gene 
regulatory (e.g. promoter, defects which can affect body 
weight . y 

A cDNA of a mutant mahogany gene may be isolated, for 
example, by using PC*, a technique which is well known to 
those of skill in the art. m this case, the first cDMA 
20 strand may be synthesized by hybridizing an oligo-dT 
oligonucleotide to mRNA isolated from tissue known or 
suspected to be expressed in an individual putatively 

s^riT mUtant mah ° 9anY a " ele - ty lending the new 

strand wxth reverse transcriptase. The second strand of the 
25 CDNA x. then synthesized using an oligonucleotide that 
hybridizes specifically to the 5 . end of the normal gene 
Using these two primers, the product is then ampiified via 
PCR. cloned into a suitable vector, and subjected to DMA 
sequence analysis through methods well known to those of 
^ skill in the art. By comparing the DNA sequence of the 

mutant mahogany allele to that of the normal mahogany allele 
the station (s, responsible for the loss of alteration of ' 
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activity of the mutant mahogany gene product .can be 
ascertained. 

Alternatively, a genomic library can be constructed 
using DNA obtained from an individual suspected of or known 
5 to carry the mutant mahogany allele, or a oDNA library can be 
constructed using RNA from a tissue known, or suspected to 
express the mutant mahogany allele. The normal mahogany gene 
or any suitable fragment thereof may then be labelled and used 
as a probe to identify the corresponding mutant mahogany 
allele is such libraries. Clones containing the mutant 

10 

mahogany gene sequences may then be purified and subjected to 
sequence analysis according to methods well known to those of 
skill in the art . 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated 

!5 from a tissue known, or suspected to express a mutant 

mahogany allele in an individual suspected of or known to 
carry such a mutant allele. In this manner, gene products 
made by the putatively mutant tissue may be expressed and 
screened using standard antibody screening techniques in 

2q conjunction with antibodies raised against the normal 

mahogany gene product as described, below, in Section 5.3. 
For screening techniques, see, for example, Harlow, E. and 
Lane, eds . , 1988, "Antibodies: A Laboratory Manual", <:old 
Spring Harbor Press, Cold Spring Harbor. In cases where a 
mahogany mutation results in an expressed gene product with 
altered function (e.g., as a result of a missense or a 
frameshift mutation) a polyclonal set of ant i -mahogany gene 
product antibodies are likely to cross -react with the mutant 
mahogany gene product. Library clones detected via their 
reaction with such labeled antibodies can be purified and 

30 subjected to sequence analysis according to methods well 
known to those of skill in the art. 
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5 - 2 - PROTEIN PRODUCTS OF THE MAHOGANY GENE 

Mahogany gene products (e.g., proteins), polypeptides 

and peptide fragments, mutant, truncated, or deleted forms of 

the mahogany gene product , and/or fusion proteins of the 

mahogany gene product can. be prepared for a variety of uses 
5 ••' 
For example, such gene products, or peptide fragments. 

thereof, can be used for the generation of antibodies in 
diagnostic assays, or for the identification of other/ 
cellular or extracellular products involved in the regulation 
of mammalian body weight. 
10. Mahogany gene products, also referred to herein as 

mahogany proteins, of the present invention include those 
gene products encoded by the mahogany gene sequences 
described in Section 5.1, above. For example, FIG. 2B, 8D 
and 9B depict murine mahogany amino acid sequences. Mahogany 
i5 gene products also include human mahogany gene products as 
shown, e^cu, in FIGS. 1033, 17A-D, 18B-D, 19D, and 20C. 

In addition, mahogany gene products may include proteins 
that represent functionally equivalent gene products. Such 
an equivalent mahogany gene product may contain deletions, 
including internal deletions, additions, including additions 
yielding fusion proteins, or substitutions of amino acid 
residues within and/or adjacent to the amino acid sequence 
encoded by the mahogany gene sequences described, in Section 
5.1, above, but that result in a "silent" change, in that the 
change produces a functionally equivalent mahogany gene 
25 product. Such amino acid substitutions may be made on the 
basis of similarity in polarity, charge,, solubility, 
hydrophobic! t.y, hydrophilicity , and/or the amphipathie nature 
of the residues involved. For example, nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
3Q isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine; polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and 
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glutamine; positively charged (basic) amino acids include 

arginine, lysine, and histidine; and negatively charged 

(acidic) amino acids include aspartic acid and glutamic acid. 

"Functionally equivalent " , as utilized herein, refers to 

5 a gene product (e.g., a protein) capable of exhibiting a 

substantially similar in vivo activity as the endogenous mg 

gene products encoded by the mg gene sequences -described i-n 

Section 5.1, above. The in vivo activity of the /ng gene 

product, as used herein, refers to the ability of the mg gene 

product, when present in an appropriate cell type, to 
10 ' . 

ameliorate, prevent, or delay the appearance of the mahogany 

phenotype relative to its appearance when that cell type 

lacks a functional mahogany gene product. 

Alternatively, where alteration of function is desired, 

deletion or non-conservative alterations can produce altered, 

15 including reduced-activity, mahogany gene products. Such 
alterations can, for example, alter one or more of the - 
biological functions of the mahogany gene product . Further, 
such alterations can be selected so a^s to generate mahbgany 
gene products that are better suited for expression, scale 

2d up, etc. in the host cells chosen. For example, cysteine 
residues can be deleted or substituted with another- amino 
acid residue in order to eliminate disulfide bridges. 

As another example, altered mahogany gene products can 
be engineered that correspond to mutants or variants of the 
mahogany gene product associated with mammalian weight 
disorders, such as obesity, cachexia, or anorexia . Altered 
mahogany gene products can also be engineered that correspond 
to mutants or variants of the mahogany gene product known to 
neutralize or ameliorate the symptoms of body weight 
disorders, such as obesity, cachexia/ or anorexia, which are 

30 mediated by some other gene, including, but not limited to, 
body weight disorders mediated by the agouti gene. 
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Also within the scope of the present invention are 
peptides and/or proteins corresponding to' one or more domains 
of the mahogany protein or any one of the individual exon 
encoded regions of the MG protein, as well as fusion proteins 
_ in which the full length mahogany protein, a mahogany . 
5 peptide, or a truncated mahogany protein or peptide is fused 
to an unrelated heterologous protein. Such proteins and 
peptides can be designed on the basis of the mahogany - ' 

nucleotide sequence disclosed in Section 5.1, above, and/or 
on the basis of the mahogany amino acid sequence disclosed in 
10 th e Section. 

the mahogany gene products of the invention further 
include .fragments of the gene products described herein. For 
example, mahogany gene product fragments can include 
fragments of at least 10, 12, 15, 20, 30, 40, 50, 100, 150, 
is 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 
1300 or more amino acids in length. 

In one embodiment, it is understood that the gene 
products of the present invention do not include a gene 
product that consists solely of the amino acid sequence of 
the attractin polypeptide depicted in FIGS. 17A-D. 

20 Fusion proteins of the invention include, but are not 

limited to, IgFc fusions which stabilize the mahogany protein 
.or peptide and prolong half life in vivo; or fusions to any : 
amino acid sequence that allows the fusion protein to be 
anchored to the cell membrane; or fusions to an enzyme, 

25 fluorescent protein, or luminescent protein which provides a 
marker function. 

The mahogany gene products, peptide fragments thereof 
and fusion proteins thereof, may be produced by recombinant 
DNA technology using techniques well known in the art. Thus, 
30 methods for preparing the. mahogany gene products, 
polypeptides, peptides, fusion peptide and fusion 
polypeptides of the invention by expressing nucleic acid 
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containing mahogany gene sequences are described herein. 
Methods that are well known to those skilled in the art can 
be used to construct expression vectors containing mahogany 
gene product coding sequences and appropriate transcriptional 
5 and translat ional control signals. These methods include/ 
for example, in vitro recombinant DNA techniques, synthetic 
techniques, and in vivo genetic recombination^ "See, for 
example, the techniques described in Sambrook, et al . , 1989, 
supra, and Ausubel, et al . , 1989, supra. Alternatively, RNA 
capable of encoding mahogany gene product sequences may be 
chemically synthesized using, for example, synthesizers. 
See, for example, the techniques described in 
"Oligonucleotide Synthesis", 1984, Gait, ed., IRL Press, 
Oxford. 

A variety of host -expression vector systems may be 
15 utilized to express the mahogany gene product coding 

sequences of the invention. Such host -expression systems 
represent vehicles by which the coding sequences of interest 
may be produced and subsequently purified, but also represent 
cells that may, when transformed or transacted with the 

20 appropriate nucleotide coding sequences, exhibit the mahogany 
gene product of the invention in situ. These include *but are 
not limited to microorganisms such as bacteria (e.g., E. 
coli, B. sxibtilis) transformed with recombinant bacteriophage 
DNA, plasmid DNA or cosmid DNA expression 'vectors containing 

25 mahogany gene product coding sequences; yeast -(e.-g., 

Saccharomyces , Pichia) transformed with recombinant yeast 
expression vectors containing the mahogany gene product 
coding sequences; insect cell systems infected, witb 
recombinant virus expression vectors (-e.g. , baculovirus) 

30 containin 9 the mah 09*ny gene product coding sequences; plant 
cell systems infected with recombinant virus expression 
vectors (e.g., cauliflower mosaic virus, <TaMV; tobacco niosaic 
virus, TMV) or transformed with recombinant plasmid -expces- 
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s.on vector, Ti plasmid, containing matlogany . gene 

product coding sequences; or mammalian cell system (e , 
COS. CHO. BHK. „ 3 . 3T 3, harboring recombinanl exp e I^' 
constructs containing probers derived £ro , the ge.ome of 

5 mammalian cells (e.q metallot-^ 

i metallothionem promoter) or from 

mammalian viruses f*=> rr j 

uses (e.g., the adenovirus late promoter; the 
vaccinia virus 7.5K promoter). 

_ in bacterial systems, a number of expression vectors may 

be advantageously seleri-Pr! ^ • ay 

for th P nJ/' selec ted depending upon the use- intended 

10 T the i mah09any 9Sne Product being expressed. For example 
when a large quantity of such a protein is to h „ 
fnr ... ^ Pfocein is to be produced, 

for the generation of pharmaceutical compositions of mahogany 
gene product or for raising antibodies to mahogany gene 
product, for example, vector, that direct the exp^Lon of 
hl9h , 1 7 1 ' ° £ fUSi0 " P**«*. that are readily 

Pur, f red may be desirable. Such vectors include, but are not 

;l t0 " ~" eXPreSSi °" PUR278 (Huther et 

pro u ct coding sequence may be ligated individualize the 

vector m frame with the lar 7 ■ 

,0 protein is produced.- pI „ ^^J 91 ™ ~ * ^ 
KT . . y vectors (Inouye and Inouye, 1985 

^ITlt' " ' 310 —- V -Hee k e and Schuster,' 
l<>1 - ,Chem - SM ' SM3 - SS »?" — the lifce. pGEX 
vec to may alsp be used to express foreign polypeptides as 
fusron proteins with glutathione S-transferase WST, In 

" T" f fUSi ° n Pr ° teinS soluble «* can easily be 

purrfrea from lysed cells by adsorption to glutathione- 

glutaT £0ll ° Wed ^ elUCi °" in «» P-sence of free 

glutathione. The p GE x vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the 

^ILX'" 9 " ^ product can 

a ". inSeCt SyStem ' *"°9«P** caiiro^ica, nuclear 
Pdyhidrosre virus (AcMPV)* is used as a vector to express 
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foreign genes. The virus grows in Spodoptera frugiperda 
cells. The mahogany gene product coding sequence maybe 
cloned individually into non-essential regions (for -example 
the polyhedrin gene) of the virus and placed under control of 
5 an AcNPV promoter (for example the polyhedrin promoter) . 

Successful insertion of mahogany gene product coding sequence 
will result in inactivation of the polyhedrin -gene and 
production of non-occluded recombinant virus (i.e., virus 
lacking the proteinaceous coat coded for by the polyhedrin 
io gene) . These recombinant viruses are then used to infect 
Spodoptera frugiperda cells in which the inserted gene is 
expressed. (e.g., see Smith, et aJ., 1983, J. Virol. 46, 
584; Smith, U.S. Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized., in cases where ah 
15 adenovirus is used as an expression vector, the mahogany gene 
product coding sequence of interest may be ligated to.an 
adenovirus transcription/translation control complex, e.g. , 
the late promoter and tripartite leader sequence. This 
chimeric gene may then be inserted in the adenovirus genome 
20 by in vitro or in vivo recombination. Insertion in a non- 
essential region of the viral genome (e.g., region El or E3) 
will result in a recombinant virus that is viable and* capable 
of expressing mahogany gene product in infected hosts, 
(e.g., See Logan and Shenk, 1984, Proc. Natl. Acad. Sci. -USA 
25 . 81, 3655-3659) . Specific initiation signals may also be 

required for efficient translation of inserted mahogany gene 
product coding sequences . These signal s include the ATG 
initiation codon and adjacent sequences. In -cases where an 
entire mahogany gene, including its own initiation codon and 
^ adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational -control 
signals may be needed. However, in cases where only a 
portion of the mahogany gene -coding -sequence is inserted, 
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init^ : C ° mUSt ^ Pr ° Vided - ^"hermore, the 

the desired coding science to- ensure translation of the 
5 entire insert. These exogenous translational control signals 

natural and synthetic.. The efficiency of expression m k 
enhanced by the inclusion of appropriate transT^ 
enhancer elements, transcription terminators, etc. .see 
10 Bxttner, eta... 1987. Methods in E „ zymoi: 153> 5 „. M ' . 
in addition, a host cell strain may be chosen that ' 
-du ates the expression of the inserted seances, or 
modifies and processes the gene product in the specific 
fashion desired. Such .edifications (e.g.. glycosylate, 
and processing <e. g .. oleavage, of protein products may be 

e ra rr: ha nt £o : the funccion ° f the — 

cells ,a ve characteristic and specific mechanisms for the 
post-translational processing and edification of proteins 
and gene products. Appropriate cell lines or host sy sZl 
can be chosen to ensure the correct edification and 
20 Processing of the foreign protein expressed. To this end 
eukaryotic host cells that possess the cellular „ h ' 
P-per processing of the primary t^ ^TJ" 
and phosphorylation of the gene product Zy b! HJ' 

cr 1 ** t, cells inciude but ~ d Lcho 

25 BHK, HeLa, COS. MDCK. 293, 3T3, andWI38 

For long-term, high-yield production of recombinant 
Proteus, stable expression is preferred. Por example ce ll 

lines that stably exDresc n,. v. ¥ ' cel1 

en(5 ,_ „ Y express the mahogany gene product may be 

engineered . Rather than using expression vectors that 

»° "IT Vi : Sl ° rl9inS bf host cells can be 

transf d Wlth DNft controUed by appropriate expression 
control elements , promoter, enhancer. seaueLes. 

transcrrpt.on terminators, polyadenylation sites, etc.,, and 
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a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 
selective media. The selectable marker in the recombinant 
5 plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes and 
grow to form foci that in turn can be cloned and expanded 
into cell lines. This method may advantageously be used to 
engineer cell lines that express the mahogany gene product . 
iq Such engineered cell lines may be particularly useful in 
screening and evaluation of compounds that affect the 
endogenous activity of the mahogany gene product. 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler, et a J . , 1977, Cell 11, 223), hypoxanthine -guanine 
!5 phosphoribosyltransferase (Szybalska and Szybalski, 1962, 
Proc. Natl. Acad. Sci. USA 48, 2026), and adenine 
phosphoribosyltransferase (Lowy, et a 1 . , 198-0, -Cell 22, 817) 
genes can be employed in tk\ hgprf or aprf cells, 
respectively. Also, antimetabolite resistance can be used as 
20 the basis of selection for the following genes: <3hfr, which 
confers resistance to methotrexate (Wigler, et ah, 19SO, 
Natl. Acad. Sci. USA 77, 3567; O'Hare, et a]. , 1981, Proc. 
Natl. Acad. Sci. USA 78, 1527); gpt, whioh oonfers resistance 
to mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. 
25 Acad. Sci. USA 78, 2072); neo, which confers resistance to 
the aminoglycoside <3- 4 18 (Colberre-Garapin, et al . , 1981, j. 
Mol. Biol. 15 0, 1); and hygro, which confers resistance to 
hygromycin (Santerre, et al . , 1984, Gene 30, 147). 

Alternatively, the expression characteristic of an 
^ endogenous mahogany gene within a cell line or microorganism 
my be modified by inserting a heterologous .DNA -regulatory 
element into the genome of a stable -cell line or -cloned 
microorganism such that the inserted -regulatory element is 
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15 

e.g 



operatively linked with the endogenous » hoga n y gene. Per 
example,. an endogenous mahogany gene which is normally 
"transcriptionally silent", i.e., a mahogany gene which is 
normally not expressed, or is: expressed only a very low 
5 level, in * cell line or microorganism, may be activated by 
inserting a regulatory element which is capable of promoting 
he expression of a normally expressed gene product in t J* 
cell lane or microorganism. Alternatively a 
transcriptionally silent, endogenous mahogany bene may be 

" th ^ inEerti ° n ° f • P^.cuou. regulatory element 

that works across cell types. 

stable c h : i C i e 7 1P90US -y b.. inserted into a 

stable cell l lne or cloned microorganism, such it is 
operat. ively u ked wlth .„ endogeno>)s mahogany gen 

r «T k "' " " r9eted h0mOl ^°- -combination, which 

are weil known- to those of skill i„ the art> and deecribed 

Its ZTT' U ' S ' PBtent N °- «.*».0S1, VS. Patent N o. 

IcrZsll LT " al - ; Int «-"°-l Application no. 

PCT/US92/09627 (W093 /09222) by Selden et al • and 

International Application „o. PCT/US90/06436 ' (WO91/06667, by 
20 Skoultcha et al., each of which is incorporated by reference 
herein in its entirety. e 

Alternatively, any fusion protein may be readily 
purafied^by. utilizing an antibody specific for the fusion 
protean being expressed. For example, a system, described by 

" denat ^ ^ '"' ^ ^ Purification of non- 

denatured fusaon proteins expressed in human cell lanes 
Wanknecht. et al . , Proc . Natl . Aca(J 8ci; , 8e 

e 9 ,2-» 976) . In thiE system, the gene of interest !s 
subc oned into a vaccinia recombination plasmid such that the 
„ " ^ ^ adln9 is translationally fused to an 

amano -terminal tag consisting of six histidine residues. 
Extracts from cells infected with recombinant vaccinia virus 
are loaded onto Ni- -nitriloacetic ac ld -agarose columns and 
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histidine-tagged proteins are selectively eluted with 
imidazole-containing buffers. 

The mahogany gene products can also be expressed in 
transgenic animals. Animals of any species, including, but 
5 not limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, sheep, and non-human primates, -e.g., 
baboons, monkeys, and chimpanzees may be used to generate 
mahogany transgenic animals. The term "transgenic," as used 
herein, refers to animals expressing mahogany .gene, sequences 
iQ from a different species (e.g., mice expressing human 

mahogany gene sequences) , as well as animals that have been 
genetically engineered to over express endogenous (i .e. , same 
species) mahogany sequences or animals that -have been 
genetically engineered to no longer express endogenous 
mahogany gene sequences (i.e., "knock-out" animals), and 
their progeny. 

Any technique known in the art may fie used to introduce 
a mahogany gene transgene into. animals to produce the founder 
lines of transgenic animals. Such techniques include, but 
are not limited to pronuclear microinjection (Hoppe and 
20 Wagner, 1989, U.S. Pat. No. 4,873,191); retrovirus mediated 
gene transfer into germ lines (Van der Putten, -et al . , 1985, 
Proc. Natl. Acad. Sci . , USA 82, 6148-6152); gene -targeting In 
embryonic stem cells (Thompson, et al . , 1989, Cell S€, 313- 
321); electroporation of embryos (Lo, 1983, Mol . Cell. Biol . 
25 3, 1803-1814); and sperm- mediated gene transfer (lavitrano et 
al., 1989, Cell 57, 717-723) (For a review of such 
techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. 
Cytol . 115, 171-229) 

Any technique known in the art may be used to produce 
3q transgenic animal clones containing a mahogany transgene, for 
example, nuclear transfer into enucleated oocytes of nuclei 
from cultured embryonic, fetal or adult <:ells induced to 
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quiescency (Campbell, et al not m». ,„ 

ax., i996„ Nature 380, 64-66- 

Wilmut, et al.. Nature 365, 810-813). ' 

that EreSent inVen " 0n prOVides «* transgenic animals 

that carry a mahogany transgene in all their cells, as well 
5. as annals that carry the transgene in some, but not al! 
their cells, i.e.. mosaic animals. The transgene may be 
integrated as a single transgene or in concatamers. 
head- to-head tandems or head-to-tail tandems. The transgene 
may also be selectively introduced into and activated in a 

'• r/f ~" ^ ty f0 »^' Sample, the teaching 
of Lasko et al. < L asko. et al., 1992 , Proc . Natl . ^ 9 

. ^8 9 . «a 2 -„3„ . The regulatory sequences required^ 
such a ceil-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to 
those of skill in the art. when it is desired that the ! 
mahogany transgene be integrated into the chromosomal site of 
the endogenous mahogany gene, gene targeting is preferred. 
Briefly, when such a technique is to be utilized, vectors 
containing some nucleotide sequences homologous to the 
endogenous mahogany gene are designed for the purpose of 
20 integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of the nucleotide 

sequence of the endogenous mahogany gene Th» . , 
-•,_„. , 3 y 9 en e- The transgene may 

also be selectively introduced into . particular cell type 
thus inactivating the endogenous mahogany gene in only Zal 
25 cell type, by following, for example, the teaching of Gu et 
ai. (Gu. etal., 1994 . Sci^e. 2<$ . l0 3-106> . The regulatory 
sequences ired £or such a ceU cype ^ ina J iv J 

Will depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art 

30 °" Ce tranS9eniC ani ™ ls have been generated, the 

expression of the recombinant mahogany gene may be assayed 
.util 121 „g standard techniques. Initial screening may be ' 
accomplished by Southern blot analysis or P CR techniques to 
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analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA .expression of 
the transgene in the tissues of the transgenic animals may 
also be assessed using techniques that include but are not 
5 limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR 
(reverse transcriptase PCR) . Samples of mahogany gene- 
expressing tissue, may also be evaluated immunocytochemically 
using antibodies specific for the mahogany transgene product. 

10 

5 - 3 • ANTIBODIES TO MA HOGANY HFNE PROniTPTg 

Described herein are methods for the production of 
antibodies capable of specifically recognizing one or more mg 
gene product epitopes, or epitopes of conserved variants, or 
peptide fragments of the mg gene products. Further. 
15 antibodies that specifically recognize mutant forms of mg 
gene products, are encompassed by the invention. 

Such antibodies may include, but are not limited to, 
polyclonal antibodies, monoclonal antibodies <mAbs) , 
humanized or chimeric antibodies, single chain antibodies, 
20 Fab fragments, F(ab'), fragments, fragments produced by a Fab 
expression library, anti- idiotypic (anti-Id) antibodies, and 
epitope-binding fragments of any of the above . Such 
antibodies may be used, for example, in the detection of a mg 
gene product in an biological sample and may, therefore, be 
25 utilized as part of a diagnostic or prognostic technique 

whereby patients may be tested for abnormal levels of mg gene 
products, and/or for the presence of abnormal forms of such 
gene products. Such antibodies may also be utilized in 
conjunction with, for example, compound screening schemes, as 
^ described, below, in Section 5.4.2, for the evaluation of the 
effect of test compounds on mg gene product levels and/or 
activity. Additionally, such antibodies can .be used in 
conjunction with the gene therapy techniques described. 
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bel °*: ln SeCti0 " = - to. for example, eval Mte the 

normal and/or engineered ma hogany-expressing h * 
their, introduction into the patient. 

in S-t"" 9enC Pr ° dUCt antlb ° dies ~y ^itionally be used 

5 Thus" h S lnhibi " n9 abn ° rmal ^ ^- Product .ctivit" 

Thus, such antibodies may, therefore, be utilis.H 

weight disorder treatment methods. *" ° f 

For the production of antibodies against a m 9 ge ne 
Product, various host animals may be immunized by inaction- 
" W " h * W ' 9 *~ Pr ° dU «- P°"ion thereof. Such Lst 

animals may include, but are not limited to rabbits mice 
and rats, to name but a few. Various adjuvants may be used 
to increase the immunological response, depending on the host 
species, including but not limited to rreund- (complete anl 
™ete,. mineral gels such as aluminum hydroxide surface 
active substances such as lysolecithin „i " " • e : 

-i .y^ecicftin, pluromc polyols 

Polyan.ons, peptides, oil emulsions, Keyhole limpet 

adjuvants such as BCG <bacille Calmette-Guerin) and 

t-orynebacterium parvum. 

20 Polyclonal antibodi^c? = 

ntiooaies are heterogeneous populations of 
antibody molecules derived from the ser, of , 
with an antigen, such as a mg g e n 7 ZJLl * 

functional derivative thereof T V an " 9enic 

, " thereof. For the production of 

polyclonal antibodies, host animals such as those described 
25 above, may be immunized by . injection with m* gene produT 
suppiemented with adjuvants as a!so described above 

o f J^TIT^Z^T homo ~ populations 

:r i that ^ t :r^s - - 

3o molecules by continuous cell l ines in culture TheS e 

include, but are not limi ted t o. the hybridpma techn U of 
Kohler and Milstein, (1975, Nature 256, 495-497. anrt Tl 

Patent N 6. 4.37 6 . 110) . the human B-ce! hybridLT 1 

^ « nycridoma technique 
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(Kosbor et al . , 1983, Immunology Today 4, 72; Cole et al . , 
1983, Proc. Natl. Acad. Sci . USA 80, 2O26-2030). and the -EBV- 
hybridoma technique (Cole et al . , i 98 5, Monoclonal Antibodies 
And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such 
5 antibodies may be of any immunoglobulin class including igG, 
IgM, IgE. IgA, IgD and any subclass thereof. The hybridoma ' 
producing the mAb of -this invention may be cultivated in 
vitro or in vivo. Production of high titers of mAbs in vivo 
makes this the presently preferred method of production. 
10 In addition, techniques developed for the production of 

"chimeric antibodies" (Morrison, et al . , 1984, Proc. Natl. 
Acad. Sci., 81, 6851-6855; Neuberger, et al . . 1984, Nature 
312, 604-608; Takeda, et al . , 1985, -Nature, 314, 452-454) by 
splicing the genes from a mouse antibody molecule of 
is appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 
can be used. A chimeri-c antibody is a molecule in which 
different portions are derived from different animal species, 
such as those having a variable region derived from a murine 
mAb and a human immunoglobulin constant region. (See, e.g., 
20 Cabilly et al . , U.S. Patent No. 4,816,567; and Boss et al . , ' 
U.S. Patent No. 4,816397, which are incorporated herein by' 
reference in their entirety.) 

In addition, techniques have been <ieveloped for the 
production of humanized antibodies. (See, e.g., <3ueen, U.S. 
25 Patent No. 5,585,089, which is incorporated herein by 

reference in its entirety.) An immunoglobulin light or heavy 
chain variable region consists of a "framework" region 
interrupted by three hyperyariable regions, referred to as 
complementary determining regions «2DRs) . The extent of 
3q the framework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", 
Kabat, E. et al . , U.S. Department of -Health a-nd Human 
Services (1983) . Briefly, humanized antibodies are antioody 
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molecules from non-human species havin 

the non-human species and a , ^ ^ CDRS fr ° m 

i-unog Wi» molecule ^ ^ ^ * *»» » 

Alternatively, techniques described for^h 
S chain .ntibodies (u . s . patent - :rs -™ 

' 3B/ y-5883; and Ward, et si , QOO 
334, 544-546) can h P a H ^ ' ' ' " aJ w 1989, Nature 

resulting • 9 ^ 3n am ^ n ° aci * bridge 

result.ng an a single chain polypeptide 

Antibody fragments that recognize cr , 
>e generated by known techni ^Z ~V 

ls .! r,9n,n " include buc ~ - "-- d t t T 

fragments, which can be prodded by pepsin dL ^ 
antibody molecule and the Fab f r dl 9estion of the 

derated by reducing the T^T^L^ 
fragments, alternatively Fab e*n ^ "'^'^ 

constructed <Huse. et al ^ 

20 allow rapid and iden i flca ; ±on 7" " 7 ^"»> *> 

^u C u L i Ucatlon Qf monoclonal t^k 

fragments with the desired specificity. ^"V** 

5 ' 4 ' ■■.^asissssk^ 

Peptide fragments thereof Ld of , 1 ' inCludin 9 

-hogany gene products and peptide f ' 
V applications indude. for eLmle SUCh 

evaluation of body weight ^ L^TV" 
30 subjects with a predisposition to such di ^"^^ «* 
; described beldw, i„ section S 4 1 "lit ' 

applications inciude methods tl ,^^' ^ 

ror the treatment of body weight 
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and body weight disorders, as described, -below, in Section 
5.4.2, and for the identification of compounds which modulate 
the expression of the mahogany gene and/or the activity of 
the mahogany gene product, as described in Section S . 4 . 3, 
5 below. Such compounds can include, for example, other 
cellular products which are involved in body weight 
regulation. These compounds can be used, for example, in the 
amelioration of body weight disorders, including obesity, 
cachexia, and anorexia. 

iQ While, for clarity, the uses described in -this section 

are primarily uses related to body weight disorder 
abnormalities, it is to be noted that each of the diagnostic 
and therapeutic treatments described herein can additionally 
be utilized in connection with other defects associated with 
the mahogany gene, such as hyperpigmentation, hyperphagia and 

15 other disorders resulting in increased metabolic rates. 

5.4.1. DIAGNOSIS OF BODY WEIGHT 
DISORDER ABNORMALITTEfi 

A variety of methods can be employed for the diagnostic 
20 and prognostic evaluation of body weight disorders, including 
obesity, cachexia, and anorexia, and for the identification 
of subjects having a predisposition to such disorders. 

Such methods may, for example, utilize reagents such as 
the mahogany gene nucleotide sequences described in Section 
^ 5.1, and antibodies directed against mahogany gene products, 
including peptide fragments thereof, as described, above, in 
Section 5.3. Specifically, such reagents may be used, for 
example, for: 

(1) the detection of the presence of tnahogany gene 
mutations, or the detection of either over- or under- 
30 expression of mahogany gene relative to levels of mahogany 
expression in a wild-type, non -body weight disorder state 
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which correlates with certain body weight disorders or 
susceptibility toward such body weight disorders- 

(2) the detection of over- or under-abundance of 
mahogany gene product relative 1-0 t->,~ v, 

uve to the abundance of mahoaanv 
5 gene product in a- wild-type non h^. • ^ mahogany 
■ W non-body weight disorder state 

whxch correlates with certain body weight disorders or 
susceptibility tpward such body weight disorders; and 

(3) the detection of an aberrsnt- r» -> 
product activity relative r ""^ leVel of -"°9any gene 
; tlv "y "lat?ve to mahogany gene product activity 

1. leVel f 1D ' "en-body weight disorder state „h ch 

correlates with certain body weight disorders or 
susceptibility toward such body weight disorders 

Mahogany gene nucleotide sequences can, for example be 
used to diagnose a body weight disorder using. f or 

the techniques for detecting mutations in Vh. u 
15 i. ^ Aons ln tn e mahogany aene 

described above in Section 5.1, above. 

• The ™ eCh ° ds Ascribed herein may be performed, for 

example, by utilizing pre-oackaoeri rf< , 

at Y _,.„„. packaged diagnostic kits comprising 

at least one specific mahogany gene nucleic acid or anti- 
mahogany gene product antibody reagent described herein 
20 »hich may be conveniently used. in clinical 

to screen and diagnose patients exhibiting body weight 
disorder abnormalities, and to screen those individuals 
exhibiting a predisposition to deveioping a body weight 
disorder abnormality. 

25 ^ For the detection of mahogany gene mutations, any 

nucleated cell can bs used as a starting source for genomic 

oTman! ^ ^ °* — 

or mahogany gene products, any cell type or tissue in which ' 

Larn^le 9 : ny ^ 1 \ < "*— ~ ~* utili.ed. such as. for 
30 gene " ~ «V ^ ^ the MG 
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Nucleic acid-based detection techniques are described, 
below, in Section 5.4.1.1. Peptide detection techniques are 
described, below, in Section 5.4.1.2. 

5 5.4.1.1. DETECTION OF MAHOGANY GENE NUCLEIC 

ACID MOLECULES 

Mutations or polymorphisms within the mahogany gene can 
be detected by utilizing a number of techniques. Nucleic 
acid from any nucleated cell can be used as the starting 
10 point for such assay techniques, and may be isolated 

according to standard nucleic acid preparation procedures 
which are well known to those of skill in the art. 

Genomic DNA may be used in hybridization or 
amplification assays of biological samples to detect 
abnormalities involving mahogany gene structure, including 

15 

point mutations, insertions, deletions and chromosomal 
rearrangements. Such assays may include, but are not limited 
to, Southern analyses, single stranded conformation 
polymorphism analyses (SSCP) , and PCR analyses. 

Diagnostic methods for the detection of mahogany gene- 

20 specific mutations can involve for example, contacting and 
incubating nucleic acids obtained from a sample, e.-g. , 
derived from a patient sample or other appropriate cellular 
source with one or more labeled nucleic acid reagents 
including recombinant DNA molecules, cloned genes or 

25 degenerate variants thereof, such as described in Section 
5.1, above, under conditions favorable for the specific 
annealing of these reagents to their complementary sequences 
within or flanking the mahogany gene . Preferably, the 
lengths of these nucleic acid reagents are at least 15 to 30 
nucleotides. 

30 

After incubation, all non-annealed -nucleic acids are 
removed f rom the nucleic acid -.mahogany molecule hybrid. The 
presence of nucleic acids that have fcybaridized, if any such 
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molecules exist, is then detected. Using such a detection 
scheme, the nucleic acid from the cell type or tissue of 
interest can be immobilized, for example, to a solid support 
such as a membrane, or a plastic surface such as that on a 
5 microtiter plate or polystyrene beads. m this case, after 
incubation, non-annealed, labeled nucleic acid. reagents of 
the type described in Section 5.1 are easily removed. 
Detection of the remaining, annealed, labeled mahogany 
nucleic acid reagents is accomplished using standard 
iq techniques well-known to those in the art. The mahogany gene 
sequences to which the nucleic acid reagents have annealed 
can be compared to the" annealing pattern expected from a 
normal mahogany gene sequence in order to determine whether a 
mahogany gene mutation is present. 

In a preferred embodiment, mahogany gene mutations or 
15 polymorphisms can be detected by using a microassay of 

mahogany nucleic acid sequences immobilized to a substrate or 
"gene chip- (see, e.g. Cronin, et al . , 1996, Human Mutation 
7:244-255). 

Alternative diagnostic methods for the detection of 
20 mahogany gene specific nucleic acid molecules, in patient 
samples or other appropriate cell sources, may involve their 
amplification, e.g., by PCR (the experimental embodiment set 
forth in Mullis, 1987, U.S. Patent No. 4,683,202), followed 
by the analysis of the amplified molecules using techniques 
25 well known to those of skill in the art, such as, for 
example, those listed above. The resulting amplified 
sequences can be compared to those that would be expected if 
the nucleic acid being amplified contained only normal copies 
of the mahogany gene in order to determine whether a mahogany 
gene mutation exists. 

30 Among those mahogany nucleic acid sequences which are 

preferred for such amplification-related diagnostic screening 
analyses are oligonucleotide primers which amplify mahogany 
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exon sequences. The sequences of such oligonucleotide 
primers are, therefore, preferably derived from mahogany 
intron sequences so that the entire exon, or coding region, 
can be analyzed as discussed below. Primer pairs useful for 
5 amplification of mahogany exons are preferably derived from 
adjacent introns. Appropriate primer pairs can *be chosen 
such that each of the 2 5 mahogany exons are amplified. 
Primers for the amplification of mahogany .exons -can be 
routinely designed by one -of ordinary skill in the art b>y 
utilizing the exon and intron sequences of mahogany shown in 

10 Figures, particularly FIGS. 3 and 5. 

Additional mahogany nucleic acid sequences which are 
preferred for such amplification-related analyses are those 
which will detect the presence of a mahogany polymorphism 
which dof f ers from the consensus mahogany sequence depicted 

15 in Figures, particularly those that detect *he polymorphism 
identified in exon 15 {Figure 7). Such polymorphisms include 
ones which represent mutations associated with body weight 
disorders such as obesity, cachexia, or anorexia. 

Further, well-known genotyping techniques can be 

2 0 performed to type polymorphisms that are in close proximity 
to mutations in the mahogany gene itself, including mutations 
associated with weight disorders such as obesity, cachexia, 
or anorexia. Such polymorphisms can be used to identify 
individuals in families likely to carry mutations in the 
mahogany gene. If a, polymorphism exhibits linkage 

2 5 

disequilibrium with mutations in the mahogany gene , tne 
polymorphism can also be used to identify individuals in the 
general population who are likely to carry such mutations. 
Polymorphisms that can be used in this way include 
restriction fragment length polymorph i-sms (RFLPs) , which 

3 0 involve sequence variations in restriction enzyme target 

sequences, single -base polymorphisms, and simple sequence 
length polymorphisms (SSLPs) . 
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For example, Weber (U.S. Pat. No 5 075 pi-n * 
DNA marker based on length polvmo u 5 '° 75 ' 217) desc ^bes a 
^ 7J length polymorphisms in blocks of (dC- 

dA)n-(dG-dT)„ short tandem repeats.. The average separation 
of (dC-dA)n-(dG-dT)n blocks is estimate i- * ? 
hn M , S estimated to be 30,000-60,000 

S bP. Marker, that are so closely spaced exhlbit . M „ 

f regency co-inheritance, and are extremely usefu! in the 
identification of genetic mutations, such as. for example 
mutates within the m aho 9 a„ y 9ane , and the diagnos ^ 

io 3 diS ° rderS rel " ed - the maho.an. 

" dn* A1 f°;. CaSkey " ( °- S - *.3 6< .7S 9 ) describe a 

DNA profiling assay for detecting short tri and tetra 

the DNA ^of T" S6qUenCeS - ™< includes extracting 

the t T""'' SUCh " the maho9any «"». -Plifylna 
" a 8 DNA, and labelling the repeat sequences to form 

a genotypic map of the individual 's DNA ^ 

id. X^T 9 ^ Pr ° be C ° Uld Sddlti °-"y ^ used to directiy 
tZ IS T' FUrther - 3 probe or primers derived 

,0 Th D NA t ainedin these 

The level of mahogany gene expression can also be 
a-ayed. Por example. KNA from a cell type or 

bram. kidney, testes, heart, liver, lung, skin 
hypothalamus, spleen, and adipose tissue may be ' isolated and 
tested utilizing hybridization or P CR techniques such as are 
described, above. The isolated cells c „n v, ^ • 
„_-i-i ,^ ceils can be derived from 

cell culture or from a patient Th. , • 
3« frn .„„ patient. The analysis of cells taken 

from culture may be a necessary ,t« ( „ 

cell* i-r, h. step in the assessment of 

ceiis to be used as part of a ~u x. 

t . . . a cell-based gene therapy 

technique or, alternatively, to test 

y, to test the effect of compounds 
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on the expression of the mahogany gene. Such analyses may 
reveal both quantitative and qualitative aspects of the 
expression pattern of the mahogany gene, including activation 
or inactivation of mahogany gene expression. 
5 In one embodiment of such a detection scheme, a cDNA 

molecule is synthesized from an RNA molecule of interest 
(e.g., by reverse transcription of the RNA molecule into 
cDNA) . All or part of the resulting cDNA is then used as -the 
template for a nucleic acid amplification reaction, such as a 
PCR amplification reaction, or the like. The nucleic acid 
reagents used as synthesis initiation reagents {e.g. , 
primers) in the reverse transcription and nucleic acid 
..amplification steps of this method are chosen from among the 
mahogany gene nucleic acid reagents described in Section 5.1. 
The preferred lengths of such nucleic acid reagents are at 
15 least 9-30 nucleotides. 

For detection of the amplified product, the nucleic acid 
amplification may be performed using radioact ively or non- 
radioactively labeled nucleotides. Alternatively, enough 
amplified product may be made such that the product may be 
20 visualized by standard ethidium bromide staining or by 

utilizing any other suitable nucleic acid staining method. 

As an alternative to amplification techniques, standard 
Northern analyses can be performed to determine the level of 
mRNA expression of the mahogany gene, if a sufficient 
2s quantity of the appropriate cells can be obtained. 

Additionally, it is possible to perform such mahogany 
gene expression assays "in situ", i.e., directly upon tissue 
sections (fixed and/or frozen) of patient tissue obtaijied 
from biopsies or resections, such that no nucleic acid 
purification is necessary. Nucleic acid reagents such as 
30 those described in Section 5.1 may be used as probes and/or 
primers for such in situ procedures (-see, for -example, Nuovo, 
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CLJ 199a . ..p CR ln Sicu „ ybridi2ation; p ^ tocols 
Applications", Raven Press, WY) . 



5.4.1.2. DETEpTJQ^ o F M.HQgMT ftpwp „„»„ TrrT _ 
Mahogany gene products, including both wild-type and 
mutant mahogany gene products, conserved variants, and 
polypeptide fragments thereof, which are discussed _ ^ . 
10 Section 5.2. m ay be detected using antibodies which are ' 
directed against such mahogany gene products. Such 
antibodies, which are discussed in Section 5.3, below, may 
thereby be used as diagnostics and prognostics for a body 
weight disorder. Such methods may be used to detect 

1S L h n lT litieS ^ leVe " ° f mah ° 9any ^expression or of 

mahogany gene product synthesis, or abnormalities in the 

^ucture, temporal expression, and/or physical location of 

mahogany gene product . The antibodies and immunoassay 

methods described herein -have, for example, important in 

20 for body weight disorders such as obesity, cachexia, and 
anorexia. Antibodies, or fragments of antibodies, such as 
those described below, may be used to screen potentially 
therapeutic compounds i n vitro to determine their effects on 
mahogany gene expression and mahogany gene product 
25 production. The compounds that have beneficial effect 
body weight disorders, such as obesity, cachexia, and 
anorexia, can thereby he id«nMfi.j ^ 

.„„,.. , y identified, and a therapeutically 

effective dose determined,. 

in vitro immunoassays may also be used, for example to 

- CliZ d e T icacy of cell - based — r„ a L; 

^tfbodi d S ' inClUdin9 ° beSity - CaCh ^ ia - — -orexia: 
Anti bodles dlreoted agaiMt atbogny gene p ^ ucts 

used in vitro to determine, for example, the level of 
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mahogany gene expression achieved in cells genetically 
engineered to produce mahogany gene product. In -the case of 
intracellular mahogany gene products, such an assessment is 
done, preferably, using cell lysates or extracts. Such 
5 analysis will allow for a determination of the number of 
transformed cells necessary to achieve therapeutic efficacy 
in vivo, as well as optimization of the gene replacement 
protocol . 

The tissue or cell type to be analyzed will -generally 
^ include those that are known, or suspected, to express the 

mahogany gene. The protein isolation methods employed herein 
may, for example, be such as those described in Harlow and 
Lane (1988, "Antibodies: A Laboratory Manual " , Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York) . The 
isolated cells can be derived from cell culture or from a 
15 patient. The analysis of cells taken from culture may be a 
necessary step in the assessment of cells to be used as part 
of a cell-based gene therapy technique or, alternatively, -to 
test the effect of compounds on the expression of the 
mahogany gene . 

2o Preferred diagnostic methods for the detection of 

mahogany gene products, conserved variants or peptide 
fragments thereof, may involve, for example, immunoassays 
wherein the mahogany gene products or conserved variants or 
peptide fragments are detected by their interaction with an 
anti -mahogany gene product-specific antii>ody. 

2 5 

For example, antibodies, or fragments of antibodies, 
such as those described, above, in Section 5.3, may be used 
to quantitatively or qualitatively detect the presence of 
mahogany gene products or conserved variants or peptide 
fragments thereof. This can be accomplished, for example, t>y 
30 immunofluorescence techniques employing a fluorescent ly 

labeled antibody (see below, this Section) coupled wi^th light 
microscopic, flow cytometric, or f luorimetaric detection. 

- 4-6 - 



BNSDOCID <WO. 0005373A3 IA> 



WO 00/05373 



PCT/US99/16484 



such techniques are especially preferred for mahogany gene 
products that are expressed on the cell surface 

The antibodies (or fragments thereof) useful in the 
present invention may. additionally, be employed 
5 histologically, as in immunofluorescence or immunoelectron 
microscopy, for in . itu detection of mahogany gene products, 
conserved variants or pepti defragments thereof. In , itu 
detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled 
.» that blndS t0 " "^^ polypeptide. T he antibody 

."i^TTV" ^^"^^ aPSlied *y overlaying the labeled 
antibody (or fragment, onto a biological sample: Through the 
use of such a procedure, it is possible to determine not only 
the presence of the mahogany gene product, conserved variant! 
or peptide fragments, but also its distribution *„ the 
examined tissue. Using the present invention, those of 
ordinary sxill Will readily recognize that any of a wide 
variety of histological methods (such as staining procedures) 
can be modified in order to achieve in situ detection of a 
mahogany gene product. 

20 immunoassays for mahogany gene products, conserved 

variants, or peptide fragments thereof will typically 
comprise, <D incubating a sample, such as a biological 

oftV eXtraCt ' freSMy harvested cells, or iysates 

of cells in the presence of a detectably labeled antibody 
25 capable of identifying mahogany gene products/conserved 

variants or peptide fragments thereof, and (2, detecting the 
the art" ^ ^ °' * ° £ well -Known in 

- imm ^ ° l09lCal SamPle ""^ bS br ° U9ht " -"'act with and 

30 *T^ * :eat ° 3 SO " d ^ ha - ^PP°« or carrier, such as 
nitrocellulose,, that is capable of immobilizing ceils, cell 
particles or soluble proteins, the support may then be 
washed with suitable buffers followed by treatmeht with the 
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detectably labeled mahogany gene product specific antibody. 
The solid phase support may then be washed with the .buffer a 
second time to remove unbound antibody. The amount of bound 
label on the solid support may then be detected by 
5 conventional means. 

By "solid phase support or carrier" is intended any 
support capable of binding an antigen or an antibody. Well- 
known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, 
iQ natural and modified celluloses, -polyacryl amides , gabbros, 
and magnetite. The nature of the carrier can be either 
soluble to some extent or insoluble for the purposes of the 
present invention. The support material may have virtually 
any possible structural configuration so long as the coupled 
molecule is capable of binding to an antigen or anti-body. 
15 Thus, the support configuration may be spherical, as in a 
bead, or cylindrical, as in the inside surface of a test 
tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled 
2 0 in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by 
use of routine experimentation. 

One of the ways in which the mahogany gene product - 
specific antibody can be detectably labeled is by linking the 
same to an enzyme, such as for use in an enzyme immunoassay 
(EI A) (Voller, A., "The Enzyme Linked Immunosorbent Assay 
(EL1SA) " , 1978, Diagnostic Horizons 2, 1-7, Microbiological 
Associates Quarterly Publication, Walkersville, MD) ; Voller, 
A. et aJ., 1978, J. Clin. Pathol. 31, S-07-520; Sutler, J.-E. , 
1981, Meth. Enzymol. 73, 482-523; Maggio, E. <ed.) , 1980, 
30 Enzyme Immunoassay, CRC Press, Boca Raton, FL, ; Ishikawa, E. 
et al., (eds.), 1981, Enzyme Immunoassay, Kgaku *Shqin, 
Tokyo) . The enzyme which is fcound to the antibody will react 
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«"th an appropriate substrate, preferably a endogenic 

t S hat Strate ; ^ * UCh 8 manner " t0 « -iety 

that can be detected, for exa mple . by spectrophctometric, 
"™-ic or by visual means. En4y mes that can be used t o 
5 detectably label the antibody include. - but are not limited 
to. malate dehydrogenase, staphylococcal nuclease, delta-5- 
steroid isomerase. yeast alcohol dehydrogenase, a- 
glycerophosphate, dehydrogenase, triose- phosphate isomerase, 
horserad.sh peroxidase, alkaline phosphatase, asparaginase 
10 9lUC ° Se ° Xidase ' 3-9alactosidase, ribonUclease. urease 

ItlTl' , 9lUC ° Se - 6 -P h °^e dehydrogenase, glucoa myl ase and 
acetylcholinesterase. The detectiQn ^ be ac . b . 

colori.etrxc methods that employ a chromogenic substrate -for 
the enzy.e. Detection may also be accomplished by visual 

comparison of the extent of enzymatic reaction ~f 
15 r'»atic reaction of a substrate 

in comparison with similarly prepared standards 

Detection may also be accomplished using any of a 

L a bir y °'° th,r —example, by radioactively 

label lng the antibodies or antibody fragments, it is possible^ 
to detect mahogany gene products through the use of a 
20 radioimmunoassay , RIA) < 5ee , for example, „eintraub. B 

Principles of Radioimmunoassays, Seventh Training Course on 
Le « Th T TeChni9UeS - The E " d °-^e society, March. . 

as the u r r iSOt ° Pe * such means 

as the use of a gamma counter or a scintillation Counter or 
25 ^autoradiography. 

It is also possible to label the antibody with a J 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, its 
presence can then be detected due to fiuprescence. Among the 

ly used fluorescent labeling compounds are 
fluorescein isothiocyanate. rhodamine. phycoerythrin. ■ 
phycocyanin. allophycocyanin. o-phthaldehyde and 
f luorescamine . 
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The antibody can also be -detectably labeled using 
fluorescence emitting metals such as 152 Eu, or others of the 
lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as 
5 diethylenetriaminepentacetic acid (OTPA) or 
ethylenediaminetetraacetic acid <EDTA) 

The antibody also can be detect ably labeled by coupling 
it to a chemiluminescent compound. The presence of the 
chemiluminescent -tagged antibody is then determined by 
^ detecting the presence of luminescence that arises during the 
course of a chemical reaction. Examples of particularly 
useful chemiluminescent labeling compounds are luminol, 
isoliiminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
15 the antibody of the present invention. Bioluminescence is a 
type of chemiluminescence found in biological systems in 
which a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent 
protein is determined by detecting the presence of 
2 0 luminescence. Important bioluminescent compounds for 

purposes of labeling are luciferin, luciferase and aaquorin. 

5.4.2. SCREENING ASSAYS FOR COMPOUNDS THAT 

INTERACT WITH THE MAHOGANY GENE OR GENE 
PRODUCT \ _^ 

25 The following assays are designed to identify compounds 

that bind to a mahogany gene product, compounds that b>ind to 
proteins, or portions of proteins that interact with a 
mahogany gene product, compounds that interfere with the 
interaction of a mahogany gene product with proteins and 

30 compounds that modulate the activity of the mahogany gene 

(i.e., modulate the level of mahogany gene expression and/or 
modulate the level of mahogany gene product activity) . 
Assays may additionally ±>e utilized that identify compounds 

- SO - 
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that bind to mahogany gene regulatory sequences (e g 

promoter sequences; see e . g ., Platt , 1994 3 

«. . a . W ; ass 62) , which is incorporated herein "y'Jference 

5 en s recy - and that can m ° du ^ - 

S 9ene expression. Such compounds may inciude. but are not 
i-ted to . mll organic molecuie ^ such a e no . 

able t the blood-brain barrier, gain to and/or entry 

into an appropriate ceU and affect expression of the 
mahogany gene or some other aene ^ • 

10 regdatory pathway, or t ^JSL^Si^ ^ ^ 

^.J^Ll" i<3en " f iCati °" ° f "* are 

aescribed, -below, m Section 5.4 2 2 Snr-h ,v . • 

Further among these compounds are compounds that af L 'the 
Xevei of mahogany gene expression and/or mahogany gene 
product activity and that can be used in the : therapeutic 
treatment of body weight disorders, including obes^y" 
-chexra. and anorexia, as described. below. i„ section 5 , 
Compounds may include K„t- * m 

20 :::::: r^zrjrr^- — ^ - 

* \fc>ee, e.g. , Lam, et al i qqi m-,^ 
o-> o/t tt ^ • ' iyyi, Nature 354 

82-84; Houghten, et aJ i qqi kt ^ 

/ cl c*^., 1^91 # Nature 354 84-ft<o 

combi atorial chemistry-derived of D- 

and/or L - configuration amino acids, phosphopeptides 
25 (including, but not iimited to members of random or P artia lly 
degenerate., directed phosphatide libraries; see . a 
ongyang. et 19 „_ -„ • 

huln 7' ^ limlted to " PolXdona!. ^nodonal. 

a^ooT ■■^- ldl ° tWlC ' - chain 

.0 r. tlb0d ^:"? a FAb ' and FAb expression library 

fragments, and epitope -binding fragments theree-r, and small 
organic or inorganic molecules. small 
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Compounds identified via assays such as those described 
herein may be useful, for example, in elaborating the 
biological function of the mahogany gene product and for 
ameliorating body weight disorders, such as obesity, 
5 cachexia, or anorexia. Assays for testing the effectiveness 
of compounds, identified by, for example, techniques such as 
those described in Sections 5.4.2.1-5.4.2.3, are discussed, 
below, in Section 5.4.2.4. 

5,4.2.1. IN VITRO SCREENING ASSAYS FOR 
10 COMPOUNDS THAT BIND TO THE MAHOGANY 

GENE PRODUCT 

In vitro systems may be designed to identify compounds 
capable of binding the mahogany gene products of the 
invention. Compounds identified may be useful/ for example, 

15 in modulating the activity of unimpaired and/or mutant 
mahogany gene products, may be useful in elaborating the 
biological function of the mahogany gene product, may be 
utilized in screens for identifying compounds that disrupt 
normal mahogany gene product interactions, or may in 

20 themselves disrupt such interactions. 

The principle of the assays used to identify compounds 
that bind to the mahogany gene product involves preparing a 
reaction mixture of the mahogany gene product and the test 
compound under conditions and for a time sufficient to allow 

25 the two components to interact and bind, thus forming a 

complex that can be removed and/or detected in the reaction 
mixture. These assays can be conducted in a variety of ways. 
For example, one method to conduct such an assay involves 
anchoring a mahogany gene product or a test substance onto a 
solid support and detecting mahogany gene product /test 

30 

compound complexes formed on the solid support at the .end of 
the reaction. In one -embodiment of such a method, the 
mahogany gene product may be anchored onto a *solid support, 
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and the test compound, which is not anchored, may be labeled 
either directly or indirectly. 

In practice, microtiter plates are conveniently 
utilized as the solid support . The anchored component may be 
5 immobilized by hon-covalent or covalent attachments. Non- 
covalent attachment- may be accomplished by simply coating the 
solid surface with a solution of the protein and drying 
Alternatively, an immobilized antibody, preferably a 
monoclonal antibody, specific for the protein to be 
^immobilized may be used to anchor the protein to the solid 
surface. The surfaces may be prepared in advance and stored 

In order to conduct the assay, the non- immobilized 
component is added to the coated surface containing the 
anchored component. After the reaction is complete, 
unreacted components are removed (e.g., by washing) 'under 
conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of complexes 
anchored on the solid surface can be accomplished in a number 
of ways. Where the previously non-immobili zed component is 
pre-labeled, the detection of label immobilized on the 
surface indicates that complexes were formed. Where the 
previously non- immobilized component is not pre-labeled an 
indirect label can be used to detect complexes anchored 'on 
the surface; e.g. , using a labeled antibody specific for the 
previously non- immobilized component (the antibody, in turn 
may be directly labeled or indirectly labeled with a labeled 
anti-Ig antibody) . 

Alternatively, a reaction can be conducted in a liquid 
Phase, the reaction products separated from unreacted 
components, and complexes detected; e.g. , using an 
immobilized antibody specific for mahogany gene product or 
the test compound to anchor any complexes formed in solution 
and a labeled antibody specific for the other component of 
the possible complex to detect anchored complexes. 
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5.4.2.2. ASSAYS FOR PROTEINS THAT INTERACT 
WITH TH E MAHOGANY GENE PRODUCT 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying mahogany gene 
product-protein interactions. 

Among the traditional methods that may -be employed are 
co-immunoprecipitation, cross- linking and co-purification 
through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
proteins that interact with mahogany gene products. Such 
10 proteins can include, but are not limited, the mahoganoid 
gene product . 

Once isolated, such a -protein can be identified and can 
be used in conjunction with standard techniques, to identify 
proteins it interacts with. For example, at least a portion 
15 of the amino acid sequence of a -protein that interacts with 
the mahogany gene product can be ascertained using techniques 
well known to those of skill in the art, such as via the 
Edman degradation technique (see, e.g., Creighton, 1983, 
"Proteins: Structures and Molecular Principles, " W.H. 
2Q Freeman & Co., N.Y., pp. 34-49). The amino acid sequence 
obtained may be used as a guide for the generation of 
oligonucleotide mixtures that can be used to screen for gene 
sequences encoding such proteins . Screening may be 
accomplished, for example, by standard hybridization or PCR 
techniques. Techniques for Uie generation of oligonucleotide 
25 mixtures and the screening are well-known. -(See, e.g., 

Ausubel, supra, and 1990, "PCR Protocols: A Guide to Methods 
and Applications, » Innis, et al . , eds . Academic Pr^ess, Inc.. 
New York) . 

Additionally, methods may be employed that result in the 
30 simultaneous identification of genes that encode a protein 
which interacts with a mahogany gene product. These methods 
include, for example, probing .expression libraries with 
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labeled mahogany gene product, using mahogany gene product in 
a manner similar to the well known technique of antibody 
probing of Agtll libraries. 

One method that detects protein interactions in vivo 
5 the two-hybrid system, is described in detail for 

illustration only and not by way of limitation. One version 
of this system has been described (Chien, et al . , 1991 , Proc 

Natl. Acad. Sci . USA, 88, 9578-958?} i ~ 

«, *=>,e sbB^j and is commercially 

available from Clontech (Palo Alto, CA). 
1Q Briefly, utilizing such a system, plasmids are 

constructed that encode two hybrid proteins: one consists of 
the DNA-binding domain of a transcription activator protein 
fused to the mahogany gene product and the other consists of 
the transcription activator protein- s activation domain fused 
to an^unknown protein that is encoded by a cDNA that has been 
recombined into this plasmid as part of a cDNA library The 
DNA-binding domain fusion plasmid and the cDNA library are 
transformed into a strain of the yeast Saccharoses 
cerevisiae that contains a reporter gene (e.*., HBS or lacZ, 
whose regulatory region contains the transcription 

20 aCtiVa '° r,S blndin9 Site - ^ther hybrid protein alone cannot 
activate transcription of the reporter gene: the DNA-binding 
domaan hybrid cannot because it does not provide activation 
function and the activation domain hybrid cannot because it 
cannot, localize to the activator's binding sites 

25 interaction of the two hybrid proteins reconstitutes the 

functional activator protein and results in expression of the 
reporter gene, which is detected by an assay for the reporter 
gene product. 

The two-hybrid system or related methodologies may be 

30 tP SCreen aCtiVat1 ^ d ° main libraries for proteins that 

interact wath the .'bait- gene product . By way of example, 
and not by way of limitation, mahogany gene products may be 
used as the bait gene product. Total genomic or cDNA 
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sequences are fused to the DNA encoding an activation domain. 
This library and a plasmid encoding a hybrid of a bait 
mahogany gene product fused to the DNA-binding domain are co- 
transformed into a yeast reporter strain, and the resulting 
5 transf ormants are screened for those that express the 

reporter gene. For example, a bait mahogany gene sequence, 
such as the open reading frame of the mahogany gene, can be 
cloned into a vector such that it is translat ionally fused to 
the DNA encoding the DNA-binding domain of the GAL4 protein. 
These colonies are purified and the library plasmids 

10 

responsible for reporter gene expression are isolated. DNA 
sequencing is then used to identify the proteins encoded by 
the library plasmids. 

A cDNA library of the cell line from which proteins that 
interact with bait mahogany gene product are to be detected 
15 can be made using methods routinely practiced in the art. 
According to the particular system described herein, for 
example, the cDNA fragments can be inserted into a vector 
such that they are translationally fused to the 
transcriptional activation domain of -GAL4 . Such a library 
20 can be co-transformed along with the bait mahogany gene-GAL.4 
fusion plasmid into a yeast strain that -contains a lacZ gene 
driven by a promoter that contains GAL4 activation sequence. 
A cDNA encoded protein, fused to a -GAL4 transcriptional 
activation domain that interacts with bait mahogany gene 
^ product will reconstitute an active <3AL4 protein and thereby 
drive expression of the HIS3 gene. Colonies that express 
HIS3 can be detected by their growth on petri dishes 
containing semi -sol id agar based media lacking histidine. 
The cDNA can then be purified from these strains, and used to 
produce and isolate the bait mahogany gene product- 
30 interacting protein using techniques routinely practiced in 
the art. 
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5.4.2.3. ASSAYS FOR COMPOUNDS THAT INTERFERE 
WITH MAHOGANY GENE PRODUCT 



MACROM OLECULE T NTERAOTTOM 



The mahogany gene products may, in vivo, interact with 
5 one or more macromolecules, such as proteins. For example 
the mahogany gene products may, in vivo, interact with the' 
mahoganoid gene products. Other macromolecules which 
interact, with the mahogany gene products may include, but are 
not limited to, nucleic acid molecules and those proteins 
1£) adentafied via methods such, as those described, above, in 
Sections 5.4.2.1 - 5 4 o r> -c— 

u , 5.4.2.2. For purposes of this discussion 

the macromolecules are referred to herein as "binding 
partners". Compounds that disrupt mahogany gene product 
banding to a binding partner may be useful in regulating the 
i5 activity of the mahogany ge „ e product, especially mutant 

mahogany gene products. Such compounds may include, but are 
not limited to molecules such as peptides, and the like as 
described, for example, in Section 5.4.2.1 above. 

The basic principle of an assay system used to identify 
compounds that interfere with the interaction between the 
2 0 mahogany gene product and a binding partner or partners 

involves preparing a reaction mixture containing the mahogany 
gene product and the binding partner under conditions and for 
a time sufficient to allow the two to interact and bind, thus 
forming a complex. In order to test a compound for 
25 inhibitory activity, the reaction mixture is prepared in the 
presence and absence of the test compound. The test compound 

/ initially ±nClUded in the -action mixture, or may be 
added at a time subsequent to the addition of mahogany gene 
product and its binding partner. Control reaction mixtures 
are incubated without the test compound or with a compound 
whach is known not to block complex formation. The formation 
of any complexes between the mahogany gene product and the 
binding partner is then detected. The formation of a complex 



30 
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in the control reaction, but not in the reaction mixture 
containing the test compound, indicates -that the compound 
interferes with the interaction of the mahogany -gene product 
and the binding partner. Additionally, -complex formation 
5 within reaction mixtures containing the test compound and 
normal mahogany gene product may also be compared to complex 
formation within reaction mixtures containing the test 
compound and a mutant mahogany gene product. This comparison 
may be important in those cases wherein it is desirable to 
identify compounds that disrupt interactions of mutant but 

10 

not normal mahogany gene product . 

The assay for compounds that interfere with the 
interaction of the mahogany gene products and binding 
partners can be -conducted in a heterogeneous or homogeneous 
format. Heterogeneous assays involve anchoring either the 

15 mahogany gene product or the binding partner onto a solid 
support and detecting complexes formed on the solid support 
at the end of the reaction. In homogeneous assays, the 
entire reaction is carried out in a liquid phase. In either 
approach, the order of addition of react ants can be varied to 

20 obtain different information about the compounds -being 

tested. For example, test compounds that interfere with the 
interaction between the mahogany gene products and t-he 
binding partners, e.g., by competition, can be identified by 
conducting the reaction in the presence of the test 
substance; i.e., by adding the test substance to the -reaction 

25 

mixture prior to or simultaneously with the mahogany gene 
product and interactive intracellular binding partner. 
Alternatively, test compounds that disrupt preformed 
complexes, e.g., compounds with higher binding constants that 
displace one of the components from the complex, can 4>e 
30 tested by adding the test compound to the reaction mixture 
after complexes have been formed. The various formats are 
described briefly below* 
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In a heterogeneous assay system, either the mahogany 
gene product or the interactive binding partner, is anchored 
onto a solid surface, while the non-anchored species is 
labeled, either directly or indirectly. In practice 
5 nucrotiter plates are conveniently utilized. The anchored 
species may be immobilized by non-covalent or covalent 
attachments. Non-covalent attachment .may be accomplished 
simply by coating the solid surface with a solution of the 
mahogany gene product or binding partner and drying 
io .Alternatively. an immobilized antibody specific for the 

species to be anchored may be used to anchor the species to 
the solid surface. The surfaces may be prepared in advance 
and. stored . 

In order to conduct the assay, the partner of the 
immobilized species is exposed to the: coated surface with or 
without the test compound. After the reaction is complete 
unreacted components are removed <e. 3 ., by washing) and any 
complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid 
surface can be accomplished in a number o£ ways Wner£ che 
20 non-immobilized species is pre-labeled. the detection of 
label immobilized on the surface indicates that complexes 
were formed, "here the non-immobiiized species is not pre- 
labeled. an indirect label can be ussd to detect complexes 
anchored on the surf aos . _ „ , 

surface, e.gr., using a labeled antibody 

25 specific for the initially non- immobilized species (the 
anybody, in turn, may be directly labeled or indirectly 
labeled with a labeled anti- Ig antibody) . Depending upon the 
order of addition of reaction components, test compounds that 
inhabit complex formation or that disrupt preformed complexes 
can be detected. 

30 - Alternatively, the reaction can be conducted in a liquid 
Phase in the presence or absence of the test compound, the 
reaction products separated from unreacted components, and 
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complexes detected; e.g., using an immobilized antibody 
specific for one of the binding components to anchor any 
complexes formed in solution, and a labeled antibody specific 
for the other partner- to detect anchored complexes. Again, 
5 depending upon the order of addition of reactants to the 
liquid phase, test compounds that inhibit complex formation 
or that disrupt preformed complexes can be identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used. In this approach, a preformed 
^ complex of the mahogany gene product and the interactive 

binding partner is prepared in which either the mahogany gene 
product or its binding partners is labeled, but the signal 
generated by the label is quenched due to complex formation 
(see, e.g., U.S. Patent No. 4,109,4 96 by Rubenstein which 
utilizes this approach for immunoassays) . The addition of a 
15 test substance that competes with and displaces one of the 
species from the preformed complex will result in the 
generation of a signal above background. In this way, test 
substances that disrupt mahogany gene product /binding partner 
interaction can be identified. 
2 0 In .another embodiment of the invention, these same 

techniques can be employed using peptide fragments that 
correspond to the binding domains of the mahogany gene 
product and/or the binding partner (in cases where the 
binding partner is a protein) , in place of one or tooth of the 
25 full length proteins. Any number of methods routinely 

practiced in the art can be used to identify and isolate the 
binding sites. These methods include, but are not limited 
to, mutagenesis of the gene encoding one of the proteins and 
screening for disruption of binding in a co- 
immunpprecipitation assay. Compensating mutations in the 
J0 gene , encoding the second species in the complex can then be 
selected. Sequence analysis of the genes encoding the 
respective proteins will reveal the mutations that correspond 
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to the region of the protein invoked in interactive binding 
Alternatively, one protein can be anchored to a solid surface 
using methods described in this Section above, and a llo „ed to 
interact with and bind to its labeled binding partner, which 
5 has been treated with a proteolytic enzyme, such as trypsin 
After washing, a short, labeled peptide comprising the 
binding domain may remain associated with the solid material 
which can be isolated and identified by amino acid 
sequencing. Also, once the gene coding for the segments is 
io engineered, to express peptide fragments of the protein it 
can then be tested for binding activity and purified or 
synthesized. 

For example, and not by way of imitation, a mahogany 
gene product can be anchored to a solid material as 
described, above, in this Section by making a CST-1 fusion 
protein and allowing it to bind to glutathione agarose beads. 
The binding partner can be labeled with a radioactive 
isotope, such as »s. and cleaved with a proteolytic en 2 yme 
such as trypsin. Cleavage products can then be added to the 
anchored GST- 1 fusion protein and allowed to bind. After 
20 "ashrng away unbound peptides, labeled bound material 
representing the binding partner binding domain, can be 
•luted, purified, and analyzed for amino acid sequence by 
well-known methods. Peptides so identified can be produced 
synthetically or produced using recombinant DNA technology 



25 



5.4.2.4 



30 



.4. ASSAYS FOR THE IDENTIFICATION OF 
COMPOUNDS THAT AMELIORATE BODY 

WEIGHT DISORngpq 

Compounds, including but not limited to binding 
compounds identified via assay techniques such as those 
descried, above, in Sections 5.4.2.1 - 5.4.2.3, can be 
tested for the ability to ameliorate body weight disorder 
symptoms, including obesity, cachexia, and anorexia 



It 
61 - 



BNSDOCID: <WO 0005373A3.IA> 



WO 00/1(5373 



.PCT/US99/H.484 



should be noted that the assays described herein can identify 
compounds that affect mahogany activity by either -affecting 
mahogany gene expression or by affecting the level of 
mahogany gene product activity. For example, compounds may 
5 be identified that are involved in another step in the 
pathway in which the mahogany gene and/or mahogany gene 
product is involved, such as, for example, a step which is 
either "upfield" or "downf ield" of the step in the pathway 
mediated by the mahogany gene. Such compounds may, -by 
Q affecting this same pathway, modulate the effect of mahogany 
on the development of body weight disorders. Such compounds 
can be used as part of a therapeutic method for the treatment 
of the disorder. 

Described below are cell^based and animal model -based 
assays for the identification^ compounds exhibiting such an 
5 ability to ameliorate body weight disorder symptoms . 

First, cell -based systems can be used to identify 
compounds that may act to ameliorate body weight -disorder 
symptoms. Such cell systems can include, for .example, 
recombinant or non- recombinant cell, such as cell lines, that 
3 express the mahogany gene. 

In utilizing such cell systems, cells that-express 
mahogany may be exposed to a compound suspected of exhibiting 
an ability to ameliorate body weight disorder symptoms, at a 
sufficient concentration and for a sufficient time to -elicit 
. such an amelioration of such symptoms in the exposed cells. 
After exposure, the cells can be assayed to measure 
alterations in the expression of the mahogany gene, e.g., by 
assaying cell lysates for mahogany mRNA transcripts (e.g., by 
Northern analysis) or for mahogany gene products expressed by 
the cell ; compounds that modulate expression .of the mahogany 
gene are good candidates as therapeutics. 

In addition, animal-based systems or -models for a 
mammalian body weight disorder, for example, transgenic mice 
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containing a human or altered f n ™ ~* , 

altered form of mahogany gene, may be 

used to identify compounds caoaW^ nf • 

K capable of ameliorating symptoms 

of the disorder. Such animal models may be used as test 

. substrates for the identification of druac ^ 
■• 4 " n or aru 9s, pharmaceuticals 

S therap.es and interventions . For example, animal models may 
be exposed to a compound suspected of exhibiting an ability 
to ameliorate symptoms. at . sufficient concentration and for 
a sufficient time to elicit such an amelioration of body 
weight disorder symptoms. The response of the animals to the 

exposure may be monitored by assessing 
10 y assessing the reversal of the 

symptoms of the disorder. 

With regard to intervention, any treatments that reverse 
any aspect of body weight disorder-lifce symptoms should be 
considered as candidates for human therapeutic intervention 
15 bv d 3 diS ° rd ~- D °"S- P* test agents may be determined 
by deriving dose-response curves, as discussed in section 
5.5.1, below. 

5.4.3. COMPOUNDS AND METHODS FOR THE TREATMENT 
OF BODY mam Mi,^, TREATMEN T 

20 k « DSSCribed below are methods and compositions whereby 
body weight disorders, including obesity, cachexia, and 
anorexia, may be treated. Such methods can comprise, for 
example administering compounds which modulate the expression 
of a mammalian mahogany gene and/or the synthesis or activity 
. of a mammalian mahogany gene product, so that symptoms of the 
body weight disorder are ameliorated. Alternatively/in 
those instances whereby the mammalian body weight disorder 
results from mahogany gene mutations, suc h methods can 
comprise supplying the mammal with a nucleic acid molecule 
encoding an unimpaired mahogany gene product such that an 
30 unimpaired mahogany gene product is expressed and symptoms of 
the disorder are ameliorated. 
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In another embodiment of methods for the treatment of 
mammalian body weight disorders resulting from mahogany gene 
mutations, such methods can comprise supplying the mammal 
. with a cell comprising a nucleic acid molecule that encodes 
5 an unimpaired mahogany gene product such that the cell 

expresses the unimpaired mahogany gene product, and symptoms 
of the disorder are ameliorated. 

Because a loss of normal mahogany gene function results 
in the restoration of a non-obese phenotype in individuals 
^ exhibiting an agouti mutation (e.g. individuals that 
ectopically express the agouti gene in all tissues) a 
decrease or elimination of normal mahogany gene product would 
facilitate progress towards a normal body weight state in 
such individuals. Methods for inhibiting or reducing the 
level of mahogany gene product synthesis or expression can 
15 include, for example, methods such as those described in 
Section 5.4.3.1. - 

Alternatively, symptoms of certain body weight disorders 
such as, for example, cachexia and anorexia, which involve a 
lower than normal; body weight phenotype, may be ameliorated 
20 b Y increasing the level of mahogany gene expression and/or 
mahogany gene product activity. Methods for enhancing the 
expression or synthesis of mahogany can include, for example, 
methods such as those described below, in Section 5.4.3.2 

5.4.3.1. INHIBITORY ANTI SENSE, RIBOZYME 
AND TRIPLE HELIX APPROACHES 

In another embodiment, symptoms of body weight disorders 

may be ameliorated by decreasing the level of mahogany gene 

expression and/or mahogany gene product activity by using 

mahogany gene sequences in conjunction with well-known 

30 antisense, gene "knock-out," ribozyme and/or triple helix 

methods to decrease the level of mahogany gene expression. 

Among the compounds that may exhibit the ability to modulate 
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the activity, expression or synthesis of the mahogany gene 
including the ability to ameliorate the symptoms of a / 
mammalian body weight disorder, are antisense, ribozyme. and 
triple helix molecules. Suc h molecules may be designed to 
5 reduce or inhibit either unimpaired, or if appropriate 

mutant target gene activity. Technics for the production 
and use of such molecules are well known to those of skill in 
the art . * 

Antisense M and DNA molecules act to directly block 
10 the of mRNA by hybridizing to targeted mRNA and 

preventing protein translation. Antisense approaches involve 
the design of oligonucleotides that are complementary to a 
target gene mRNA. The antisense oligonucleotides will bind- 
to the complementary target gene mRNA transcripts and prevent 
translation. Absolute complementary, although preferred. 
x => is not required. 

A sequence -complementary" to a portion of an RNA. as 
referred to herein, means a sequence having sufficient 

rziirTT to be abie to hytridize with ^ »». 

a stable duplex; in the case of double-stranded antisense 
20 nucleac acids, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementary 
and the length of the antisense nucleic acid. Generally, the 
longer the hybridizing nucleic acid, the more base mismatches 
2s with an rn* it „ y contaln and stin ^ g scafaie J 

triplex, as the case may be) . One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard ■"' 
procedures to determine the mating point of the hybridized 

complex. 2 

in one embodiment, oligonucleotides complementary to - 
non-codrng regions of the mahogany gene could be used in an 
antisense approach to inhibit translation of endogenous 
mahogany mSNA. Antisense nucleic acids should be at least V 
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six nucleotides in length, and are preferably 

oligonucleotides ranging -from 6 to about 50 nucleotides in 

length. In specific aspects the oligonucleotide is at least 

10 nucleotides, at least 17 nucleotides, at least 25 

5 nucleotides or at least 50 nucleotides. 

Regardless of the choice of target sequence, it is 

preferred that in vitro studies are first performed to 

quantitate the ability of the antisense ol igonucieotide to 

inhibit gene expression. It is preferred that: these -studies 

utilize controls that distinguish between antisense qene 
10 3 
inhibition and nonspecific biological effects of . 

oligonucleotides. It is also preferred that these studies 
compare levels of the target RNA or protein with that of an 
internal control RNA or protein. Additionally, it is 
envisioned that results obtained using the antisense 
15 oligonucleotide are compared with those obtained using a 
control oligonucleotide. It is preferred that the control 
oligonucleotide is of approximately the same length as the 
test oligonucleotide and that the nucleotide sequence of the 
oligonucleotide differs from the ant i sense sequence no more 
20 than is necessary to prevent specific hybridization to the 
target sequence. 

The oligonucleotides can be DNA or RNA or chimeric 
mixtures or derivatives or modified versions thereof, single- 
stranded or double- stranded. The oligonucleotide can be 
^ modified at the base moiety, sugar moiety, or phosphate 

backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotide may include other 
appended groups such as peptides (e.g., for targeting host 
cell receptors in vivo) , or -agents facilitating transport 
across the cell membrane (see, e.g., l^etsinger, et al . , 1989, 
30 Proc. Natl. Acad. Sci . U.S.A. 86, -6553 -€5S<6 ; JLemaitre, et 
al. , 1987, Proc. Natl. Acad. Sci. U.S.A. 84, -648-652; PCT 
Publication No. WO88/0981O, published December 15, 198£) or 
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the blood-brain barrier (see, e.g., PCT Publication No 
WO89/10134, published April 25, 1988), hybridization- 
triggered cleavage agents (see, e.g., Krol et al 1988 
BioTechniques 6, 958-976) or intercalating agents (see/e g 
5 2on, 1988, Pharm. Res. 5, 539-549) . To this end the 

oligonucleotide may be conjugated to another molecule e g 
a peptide, hybridization triggered cross-linking agent 
transport agent, hybridization- triggered cleavage agent etc 
The antisense oligonucleotide may comprise at least one 
1Q modafied base moiety which is selected from the group 

deluding but not limited to 5-f luorouracil , 5-bromouracil 
5-chlorouracil, 5 - iodouracil , hypoxanthine , xanthine, 
4-acetylcytosine, 5 - (carboxyhydroxyl meth yl ) uracil ., 
5 - carboxymethylami nomethyl - 2 - thiouridine , 
^ 5-carboxymethylaminomethyluracil, dihydrouracil , beta-D- 
galactosylqueosine, inosine, N6-isopentenyl adenine 

1- methylguanine, i-methylinosine. 2,2-dimethylguanine 

2- methyladenine, 2 -methylguanine, 3 -methyl cytosine, 
5-methylcytosine, N6-adenine, 7-methylguanine 

5-methylaminomethyluracil, 5-methoxyaminomethyl -2-thiouracil 
20 beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil 
5-methoxyuracil , 2-methylthio-N6-isopent e nyladehine 
uracil-5-oxyacetic acid (v) , wybutoxosine , pseudouracil 
queosine, 2 -thiocytosine, 5-methyl -2-thiouracil , 
2-thiouracil, 4 - thiouracil , 5-methyluracil , uracil- 
25 5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v) 
5-methyl-2-thiouracil, 3- <3-amino-3- N -2-carboxypropyl) 
uracil, (acp3)w, and 2 , 6 -diaminopurine . 

The antisense oligonucleotide may also comprise at least 
one modified sugar moiety selected from the group including 

30 ^ llmite4 t0 -"binose, 2,fluoroarabinose; xylulose, 

and hexose . 

in yet another embodiment/the antisense oligonucleotide 
comprises at least one modified phosphate backbone selected 
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from the group consisting of a phosphorothioate , a 
phosphorodithioate , a phosphoramidothioate , a 

phosphoramidate , a phosphordiamidate , a methylphosphonate , an 
alkyl phosphbtriester , and a formacetal or analog thereof. 

In yet another embodiment, the ant i sense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double- stranded hybrids with 
complementary RNA in which, contrary to the usual £-units, 
the strands run parallel to each other (Gautier, et al . , 
1987, Nucl . Acids Res. 15, 6625-6641). The oligonucleotide 
is a 2'-0-methylribonucleotide (Inoue, al . , 19&1 1 Nucl. 

Acids Res. 15, 6131-6148), or a chimeric RNA - DN A analogue 
(Inoue, et al.. , 1987, FEBS Lett. 215, 327-330). 

Oligonucleotides of the invention may be -synthesized by 
standard methods known in the art, e.g., by use of an 
automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.) . As examples, 
phosphorothioate oligonucleotides may be synthesized by the 
method of Stein/ et al . (1988, Nucl. Acids Res. 16, 32*09) , 
methylphosphonate oligonucleotides can be prepared by use of 
0 controlled pore glass polymer supports -{'Sarin, et aJ., 1S-8S , 
Proc. Natl. Acad. Sci . U.S.A. 85, 744S-7451) , etc. 

While antisense nucleotides complementary to the target 
gene coding region sequence could be used, those 
complementary to the transcribed,, untranslated region are 
most pref erred 

5 

Antisense molecules should -be delivered to cells that 
express the target gene in vivo. A number of methods -have 
been developed for delivering antisense DNA or RNA to cells; 
e.g., antisense molecules can be injected directly into the 
tissue site, or modified antisense molecules, designed to 
0 target the desired cells (e.g., anti-sense linked to peptides 
or antibodies that specifically bind receptors or antigens 
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expressed on the target cell surface) can be administered 
.systemically. • 

However, it is often difficult. to achieve intracellular 
concentrations of the antisense sufficient to suppress . 
5 translation of endogenous mRNAs . Therefore a preferred 
approach utilizes a recombinant DNA construct in -which the. 
antisense oligonucleotide is placed under the control of a 
. strong pol III or pol II promoter. The use of such a 
construct to transfect target cells in the patient will . 
^ result in the transcription of sufficient amounts of single 
stranded RNAs that will form complementary base pairs with 
the endogenous target gene transcripts and thereby prevent 
translation of the target gene mRNA. For example, a vector 
can be introduced e.g., such that it is taken up by a cell 
and directs the transcription of an antisense RNA. Such a 
15 vector can remain episomal or become chromosomal ly 

integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by 
recombinant DNA technology methods standard in the art. 
Vectors can be plasmid, viral, or others known in the art, 
20 used for replication and expression in mammalian cells. 

Expression of the sequence encoding the antisense RNA can be 
by any promoter known in the art to act in mammalian, ■ 
preferably human cells. Such prompters can be inducible or 
constitutive. Such promoters include but are not limited to: 
2s the SV40 early promoter region (Bernoist and Chambon, 1981, 
Nature 290, 304-310), the promoter contained in the 3' long 
terminal repeat of Rous sarcoma virus (Yamamoto, et al . , 
1980, Cell 22, 787-797), the herpes thymidine kinase promoter 
(Wagner, et al . , 1981, Proc . Natl. Acad. Sci . U.S.A. 78, 
1441-1445), the regulatory sequences of the metal lothionein 
30 gene (Brinster, et al . , 1982, Nature 296, 39-42), «tc. Any 
type of plasmid, cosmid, YAC or viral vector can be used to 
prepare the recombinant DNA construct which can be introduced 
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directly into the tissue site. Alternatively, viral vectors 
can be used that selectively infect the desired tissue, in 
which case administration may be accomplished by another 
route (e.g., systemically) . 

Ribozyme molecules designed to catalytic-ally cleave 
target gene mRNA transcripts can also be used to prevent 
translation of target gene mRNA and, therefore, expression of 
target gene product. (See, e.g., PCT International 
Publication WO90/11364, published October 4, 1990; "Sarver, e-t 
al . , 1990, Science 247, 1222-1225). 

Ribozymes are enzymatic RNA molecules capable of 
catalyzing the specific cleavage of RNA. (For a review, see 
Rossi , 1994, Current . Biology 4, 469-471)-. The mechanism of 
ribozyme action involves sequence specific hybridization of 
the ribozyme molecule to complementary target RNA, followed 
by", an endonucleolytic cleavage event. The composition of 
ribozyme molecules must include one or more sequences 
complementary to the tar-get gene mRNA, and -must include the 
well known catalytic sequence responsible for mRNA cleavage. 
For this sequence, see, e.g., U.S. Patent No. 5,093,24:6, 
which is incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific 
recognition sequences can be used to destroy target gene 
mRNAs , the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations -dictated by 
flanking regions that form complementary base pairs with the 
target mRNA. The sole requirement is that the target mRNA 
have the following sequence of two bases: S'-^JG-S 1 . The 
construction and production of hammerhead ribozymes is well 
known in the art and is described more fully in Myers, 1995, 
Molecular Biology and Biotechnology; A Comprehensive Desk 
Reference, VCH Publishers, New York, (see especially .^i^ure 
4, page 833) and in Haseloff and <3erlach, 19$$, Nature, 334, 
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585-591, which is incorporated herein by reference in its 

entirety . ' : 

Preferably the ribozyme is engineered so that the 
cleavage recognition site is located near the 5' end of the 
5 target gene mRNA, i.e., to increase efficiency and minimize 

the intracellular accumulation of non- functional mRNA 
transcripts. 

The ribozymes of the present invention also include RNA 
endoribonucleases (hereinafter "Cech-type ribozymes") such as 
the one that occurs naturally in Tetrahymena thermophila 
(known as the IVS, or L-19 IVS RNA) and that has been 
extensively described by Thomas Cech and collaborators (Zaug 
et al., 1984, Science, 224, 574-578; Zaug and Cech, 1986 
Science, 231, 470-475; Zaug, et al . , 1986, Nature, 324, 429- 
433; published International patent application No. WO 
88/04300 by University Patents Inc.; Been and Cech, 1986 
Cell, 47, 207-216). The Cech- type ribozymes have an eight 
base pair active site which hybridizes to a target RNA 
sequence whereafter cleavage of the target RNA takes place 
The invention encompasses those Cech-type ribozymes which 
20 target eight base-pair active site sequences that are present 
in the target gene. 

As in the antisense approach, the ribozymes can be 
composed of modified oligonucleotides (e.g. , for improved 
stability, targeting, etc.) and should be delivered to cells 
25 that express the target gene in vivo. A preferred method of 
delivery involves using a DNA construct "encoding" the 
ribozyme under the control of a strong constitutive pol III . 
or pol II promoter, so that transacted cells will produce 
sufficient quantities of the ribozyme to destroy endogenous 
target gene messages and inhibit translation. Because 
ribozymes unlike antisense molecules, are catalytic, a lower 
intracellular concentration is required for efficiency 
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Endogenous target gene expression can also be reduced by 
inactivating or "knocking out" the target gene or its 
promoter using targeted homologous recombination (e.g., see 
Smithies, et a J . , 1985, Nature 317, 230-234; Thomas and 
5 Capecchi, 1987, Cell 51, 503-512; Thompson, et al . , 1989, 
Cell 5, 313-321; each of which is incorporated by reference 
herein in its entirety). For example, a mutant, non- 
functional target gene (or a completely unrelated DNA 
sequence) flanked by DNA homologous to the endogenous target 

^ gene (either the coding regions or regulatory regions of the 
target gene) can be used, with or without a selectable marker 
and/or a negative selectable marker , to transfect cells that 
express the target gene in vivo. Insertion of the DNA 
construct, via targeted homologous recombination, results in 
inactivation of the target gene. Such approaches are 

15 particularly suited in the' agricultural field where 

modifications to ES (embryonic stem) cells can te used to 
generate animal offspring with an inactive target gene (e.g., 
see Thomas and Capecchi, 1987 and Thompson, 19>89, supra). 
However this approach can be adapted for use in humans 

20 provided the recombinant DNA constructs are directly 

administered or targeted to the required -site in vivo using 
appropriate viral vectors. 

Alternatively, endogenous target gene expression can be 
reduced by targeting deoxyribonucleot ide sequences 

25 complementary to the regulatory region of the target gene 
(i.e., the target gene promoter and/or enhancers) to form 
triple helical structures that prevent transcription of the 
target gene in target cells in the body. (See generally, 
Helene, 1991, Anticancer Drug Des . , 6 (-6), 5^9-584 ; Selene, et 
al., 1992, Ann. N.Y. Acad. Sci . , 660, 27 -3«; and Maher, 1992, 

3 0 

Bioassays 14 (12) , 807-815) . 

Nucleic acid molecules to be used in triplex ^helix 
formation for the inhibition of transcription should -te 
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single stranded and composed of deoxynucleotides . The base 
composition of these oligonucleotides must be designed to 
promote triple helix formation via Hoogsteen base pairing 
rules, which generally require sizeable stretches of either 
5 purges or pyrimidines to be present on one strand of a 
duplex. Nucleotide seouence* «,« v,= • . , 

q es ma y be pynmidine-based. which 
-.Will result in TAT and COC- triplets across the three 
associated strands of the resulting triple helix. The 
pyrimidine-rich molecules provide base complementary to a 
purine-rich region of a single strand . M the duplex in a 
parallel orientation to that strand. In addition, nucleic 
ac ld molecules may be chosen that' are purine-rioh, for 
example, contain a stretch of G residues. These molecules 
w.ll form a triple helix with a D N A duplex that is rich in GC 
pa.rs. in which the majority of the purine residues are 
ocated on a single strand of the targeted duplex, resulting 
" 1PletS aCr ° ss the three strands in the triplex 
Alternatively, the potential sequences that can be' 
targeted for triple helix formation may be increased by 
creating a so called "switchback-nucleic acid molecule 
20 switchback molecules are synthesized in an alternating 5.-3. 
3 -5. manner, such that they base pair with first one strand' 
of a duplex and then the other, eliminating the necessity for 

* S12Sable Stret ° h ° £ P~i*. or pyrimidines to be 

present on one strand of a duplex. 

2 in instances wherein the anti sense, ribozyme, and/or 

traple helix molecules described herein are utilized to 
inhabit mutant gene expression, it is possible that the 
technique may so efficiently reduce or inhibit the 

; transcription (triple helix) and/cr translation (antisense, 

30 tneT Pr0dUCed ^ n0rmal tar9et allies that 

taLr 8 arlSe Whereln the concentration of normal 

target gene product present may be lower than is necessary 
for a normal phenotype. In such cases, to ensure that 
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substantially normal levels of target gene activity, are 
maintained, therefore, nucleic acid molecules that encode and 
express target gene polypeptides exhibiting normal target 
gene activity may, be introduced into cells via gene -therapy 
5 methods such as those described, below, in Section 5.9.2 that 
do not contain sequences susceptible to whatever antisense, 
ribozyme, or triple helix treatments are toeing utilized. 
Alternatively, in instances whereby the target gene encodes 
an extracellular protein, it may be preferable to co- 
administer normal target gene protein in order to maintain 

10 

the requisite level of target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix 
molecules of the invention may be prepared -by any method 
known in the art for the synthesis of DNA and RNA molecules, 
as discussed above. These include techniques for chemically 

15 synthesizing oligodeoxyribonucleot ides and 

oligoribonucleotides well known in the art ^uch as for. 
example solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and 
in vivo transcripition of DNA sequences encoding t-he antisense 

20 RNA molecule. Such DNA sequences may be incorporated into a 
wide variety of vectors that incorporate suitable RNA 
polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that 
synthesize antisense RNA constitut iyely or inducibly, 

~ ^ depending on the promoter used, can fee introduced stably into 
cell lines . 

-5.4.3.2. GENE REPLACEMENT THERAPY 

Mahogany gene nucleic acid sequences, described above in 
Section 5.1, can be utilized for the treatment of a mammalian 
30 body weight disorders, including obesity, cachexia, and 
anorexia. Such treatment oan be in the form of gene 
replacement therapy. Specifically, one or more ^copies of a 
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normal mahogany gene or a portion of the mahogany gene that 
directs the production of a mahogany gene product exhibiting 
normal mahogany gene function, may be inserted into the 
appropriate cells within a patient, using vectors that 
5 include, but are not limited to adenovirus, adeno-associated 
varus, and retrovirus vectors, in addition to other particles 
that introduce DNA into cells, such as liposomes. 

Because the mahogany gene is expressed in the brain, 1 
such gene replacement therapy techniques should be capable 
iq delivering mahogany gene sequences to these cell types within 
patients. Thus, in one embodiment, techniques that are well 
known to those of skill in the art (see, e.g.,' PC T 
Publication No.' WO89/10134, published April 25, 1988) can be 
used to enable mahogany gene sequences to cross the blood- 
brain barrier readily and to deliver the sequences to cells 
in the brain. With respect to delivery that is capable of 
. crossing the blood-brain barrier, viral vectors such as, for 
example, those described above, are preferable. 

In another embodiment, techniques for delivery involve 
direct administration of such mahogany gene sequences to the 
20 site of the cells in which the mahogany gene sequences are to 
be expressed. 

Additional methods that may be utilized to increase the 
overall level of mahogany gene expression and/or mahogany 
gene product activity include using target homologous 
25 recombination methods, discussed in Section 5.2, above, to 
modify the expression characteristic of an endogenous ' 
mahogany gene in a cell or microorganism by inserting a 
heterologous DNA regulatory element such that the inserted 
regulatory element is operatively linked with the endogenous 
mahogany gene in question. Targeted homologous recombination 
can be thus used to activated transcription of an endogenous 
mahogany gene that is "transcriptionally silent", i.e., is 
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not normally expressed, or to enhance the expression of an 
endogenous mahogany gene that is normally expressed. 

Further, the overall level of mahogany gene -expression 
arid/or mahogany gene product activity, may be increased by the 
5 introduction of appropriate mahogany-expressing cells/ 

preferably autologous cells, into a patient' at positions .and 
in numbers that are sufficient to ameliorate body weight 
disorder symptoms. Such cells may be either recombinant or 
non- recombinant . 

10 Among the cells that, can be administered to increase the 

overall level of mahogany gene expression in a patient are 
normal cells, preferably brain cells, that express the 
mahogany gene. Alternatively, cells, preferably autologous 
cells, can be engineered to express mahogany gene sequences, 
and . may then be introduced into a patient in positions 

!5 appropriate for the amelioration of the body weight disorder 
symptoms. Alternately, cells that express an unimpaired 
mahogany gene and that are from a MHC matched individual can 
be utilized, and may ^include, for example, brain cells.. The 
expression of the mahogany gene sequences is controlled by 

2o the appropriate gene regulatory sequences to allow such 

expression in the necessary cell types. Such gene regulatory 
sequences are well known to the skilled artisan. Such cell- 
based gene therapy techniques are well known to those skilled 
in the art, see, e.g., Anderson, U.S. Patent No. S, 399, 349. 

25 When the cells to "be administered are non -autologous 

cells, they can be administered using well known techniques 
that prevent a host immune response agaiinst the introduced 
cells from developing. For example, the cells may be 
introduced in an encapsulated form -which, while allowing for 
an exchange of components with the immediate extracellular 

30 environment, does not allow the introduced cells to be 
recognized by the host immune system. 
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Additionally, compounds, such as those identified VJL „ 
techniques such as those described, above, in Section 5 4 2 
that are capable of modulating mahogany gene product activity 
can be administered using standard techniques that are well 
5 known to those of skill in the art. m instances in which 
the compounds to be administered are to involve an 
interaction with brain cells, the administration techniques 
should include well known ones that allow for a crossing of 
the blood-brain barrier. 

10 

5 . 5 . PHARMACEUTICAL PREPARATIONS AND 
METHODS OF A DMINISTRATION 

The compounds that are determined to affect mahogany 
gene expression or gene product activity can be administered 
to a patient at therapeutically effective doses to treat or 
15 ameliorate body weight disorders, such as obesity, anorexia 
or cachexia. A therapeutically effective dose refers to that 
amount of the compound sufficient to result in amelioration ' 
of symptoms of such a disorder. 

2 0 5.5.1. EFFECTIVE DOSR 

Toxicity and therapeutic efficacy of such compounds can 
be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the 
LD 50 (the dose lethal to 50% of the population) and the- ED S0 ' 
(the dose therapeutically effective in 50% of the 
population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as V 
the ratio LD 50 /ED 50 . Compounds that exhibit large therapeutic 
andices are preferred. While compounds that exhibit toxic 
sade effects may be used, care should be taken to design a 
30 delivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects 



LD 

25 
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The data obtained from the cell culture assays and 
animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that 
5 include the ED 50 with little or no toxicity. The dosage may 
vary within this range depending upon the -dosage form 
employed and the route of administration utilized. For any 
compound used in the method of the invention, the 
therapeutically effective dose can be estimated initially 
10 frOTTV cel1 culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range 
that includes the IC 50 (i.e., the concentration of the test 
compound that achieves a .half -maximal inhibition of symptoms) 
as determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. Levels 
15 in plasma may be measured , for example , . by high performance- 
liquid chromatography. 

5,5.2. FORMUIi AT I ONS AND USE 

Pharmaceutical compositions for use in accordance with 
2 0 the present invention may be formulated in conventional 

manner using one or more physiologically acceptable carriers 
or excipients . 

Thus, the compounds and their physiologically acceptable 
salts and solvates may be formulated for administration by 

25 inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

For oral administration, the -pharmaceutical compositions 
may take the form of , . f or example, tablets or capsules 
prepared by conventional means with pharmaceutical ly 
acceptable excipients such as binding agents (e.g., 

30 pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose) ; fillers (e.g., lactose, 
micrbcrystalline cellulose or calcium ^hydrogen phosphate) ; 
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lubricants (e.g., magnesium stearate, talc or silica, ■ 
disintegrants (e.g., potato starch or sodium starch 
9lycolate ); or wetting a 9 e„ts (e.g., sodlum lauryJ sulphate) . 
The tablets may be coated by methods well known in the art 
5 Liguxd preparations for oral administration may take the form 
Of. for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water 
or other suitable vehicle before use. Such liquid 
preparations may be prepared by conventional means with 
10 pharmaceutical^ acceptable additives such as suspending 
agents (e.g.. sorbitol syrup, cellulose derivatives or 
hydrogenated edible fats,, emulsifying agents (e.g.. lecithin 
or acada); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils, , and 
^ preservatives (e.g.. methyl or propyl -p-hydroxybenzoates or 
sorbic acid,. The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
appropriate. 

Preparations for oral administration may be suitably 
formulated to give controlled release of the active compounds 

form I t T <*• compositions may take the 

form of tablets or lozenges formulated in conventional 



manner 



For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
25 in the form of an aerosol spray presentation from pressurized 
packs or ^nebulizer, with the use of a suitable propellant. 
e.g. , dichlorodif luoromethane. trichlorof luoromethane 
dichlorotetrafluoroethane. carbon dioxide or other suitable ' 
gas in the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount capsules and cartridges of e.g.. gelatin for use in 
an inhaler, or insuf f Jator may be formulated containing a 



gas 

30 



79 - 



BNSDOCID: <WO 0OOS373A3 IA> 



WO 00/05373 



PCT/US99/J6484 



powder mix of the compound and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g., by bolus injection or 
5 continuous infusion. Formulations for injection may be 

presented in unit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 
10 formulatory a 9 ents such a s suspending, stabilizing and/or 

dispersing agents. Alternatively, the active ingredient may, 
be in powder form for constitution with a suitable vehicle, 
e.g. , sterile pyrogen- f ree water, before use . 

The compounds may also be formulated in rectal 
compositions such as suppositories or retention enemas, e.g., 

15 

containing conventional suppository basses such as cocoa 
butter or other glycerides. 

In addition to the formulations described previously, 
the compounds may also be formulated as a depot preparation. 
Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) 
or by intramuscular injection. Thus, for example, the 
compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an -emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly 

25 soluble derivatives, for example, as a sparingly soluble 
salt . 

The compositions may, if desired, be presented in a pack 
or dispenser device that may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister 
30 pack. The pack or dispenser device may be accompanied -by 
instructions for administration. 
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6 * EXAMPLE : GENETIC AND PHYSICAL MAPPING 

OF THF: m * w OGANY T.QCUfi 

In the Example presented herein studio 
which fircr „- ' studies are described 

e. First, LDJ-Le-mg mice were cr with CAST /Ei „ 

The Fls were back-crossed with LDJ L( . T/E ' mCe ' 

" —ltin 9 litters scored £ o ^ ' ^ 

*olor of; homozygotes were genot H ~ at 

showing recombinant events we^f • 

various « rkers shown ^ ™ — 

15 performed using pcr SSL p / 9 en °'ypin 9 was 

After 2300 e " analyZe<S USi "9 PAaE - 

er 2300 ^eiosesis i-ho m ^ 

0.» =» interval FIG . This ^°** nJ ' ^ - * ""^ *> • 

width of 700 Kb. P ° nded £ ° « i«erva! 

25 7. EXAMPLE: IDENTIFICATION OP A CANDIDATE 

— MAHOGANY GE*ns? 



which corresponds eo , «• 

Clones spanning the 700WH >-^.- 
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novel sequences encoding proteins are located in this 
integral, see the bottom of FIG. 1. With each open reading 
frame identified, mutational analysis, primarily via SSCP 
analysis, was used with the three alleles of the mahogany 
phenotype mace to identify which of the open reading frames 
5 within this, interval contain a mutation in an mg mouse. 
A mutation was found in one of the genomic/cDNA 
sequences found in the integral in mg3J mice. Figures 3 and 
..2 provide the genomic and cDNA sequences surrounding the 
mutation, FIG. 6 shows the mutation in mg3J, and FI-GS . 8 and 
10 5 show splice variants in the 5' end of the murine mg -gene . 
- The mutation in mg3J mice is a deletion of a . GCTGC sequence 
which results in the creation of a frameshift. -Based on the 
chromosomal location and mutation identification/ the cDNA 
provided in Figure 2 and the corresponding genomic DNA which 
contains the contigs provided in Figure 3 represent the mg 

15 

gene/locus . 

Further analysis of cDNA clones identified two distinct 
splice variants in the 5* end of the mg gene. Figure 7 
provides an analysis of the structure of the two splice 
variants, denoted akm!003 and akm!004 . Figures 8 and 9 

2 0 provide the nucleic acid and amino acid sequence of the 5 1 
ends of these splice variants and structural analysis of the 
protein encoded by the 5 1 regions. 

Analysis of libraries of human cDNA sequences led to the 
identification of three forms of the human ortholog of the mg. 

25 gene: a long form (FPGS. 18A-D) and two shorter splice 
forms, each of which is shown in FIGS. 19A-D and 20A--C. 

8. EXAMPLE: CHARACTERIZATION OF 

THE MAHOGANY GENE 

In the example presented herein, the nucleic acid 

3 0 

sequence of. the mahogany gene ; transcript iclent if ie-d in the 
example presented in Section 7, above, is used to -generate 
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Northern analysis data which characterize th. 

5.-8 3 ene b ein 9 the mano 3 an y sene ' C ° Ml "" t «» 

For Northern analysis, polyA rna was isolate f 

. protocols. Northern blots prepared from t-H 
v. w ■ , . H^pdrea rrom this mRNA wa-Q 

^hybridized with a probe obtained from sequencesTl" , ' 
akml003 and akm!004 sequences q „'- r q " enCSS Common to the 
TTCCTCACrcr * ^ eqUences - Specifically PC R primers 
11 CCTCACTGG and GGACACACAG were u<?i*r? «-« -, • ^ 
^003 sequence „ hich had b J ^ £ «- 

■ 5 An mg transcript was found in all m-i ~> 

seen in ^ "* *> expression was 



20 mice L N ° r h ^. bl - ™» on RNA samples f rom mahogany 
20 •««, the m g transcript was found to be expressed at a 

reduced level in all tissues in mRNA isolated from L 3J . 

• W. in ° rl9inal "9 —ant mice derived 

25 These results are consistent with ,-h. 

5 herein as being Mahogany gene". "* ^ 

9. EXAMPLE : EFFECTS OF THE MAHOGANY GENE 

on grnfttp arm nimpv „ PPe , TTY 

SpecificaUy. these e xperl melts te „ c ~ ^ ■ . 
the ooesity of other ^enic ohese mutants Z Z^T^ 
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whether it can suppress diet-induced -obesity. The results 
show that mg does not suppress obesity , in any of the 
monogenic obese mutants. However, mg can suppress diet - 
induced obesity. Thus, the gene and its corresponding 
5 gene product and compounds that modulate mg expression and/or 
activity have implications in the treatment of diet- induced 
obesity disorders,' as well as in the treatment of disorders 
related directly to the mg or agouti gene. 

0 9 • 1 • . MATERIALS AND METHODS 

Genetic crosses : The crosses, and the number of -animals 
for each (n) were (LDJ /Le-mg/mg X CAST/Ei) X LDJ/Le-mg/zng 
(n=1588), ^HeB/FeJ-zng"/ mg 3 ' X CAST/Ei ) X C3HeB/Fe J-zng^/ .mg"' 
(n=324), (CSHeB/FeJ-mg 3 - 3 / mg 3 ' X MOLF/Ei) X C3HeB /Fe J - mg 3 ' / mg 3 ' 
^ (n=216) and (C3HeB/FeJ-mg 3J / mg 3 ' X C57BL6/J) X C3HeB/F*J^V 
mg 3 .' (n=309) . The 2437 N 2 mice were analysed by coat colour to 
determine their genotype at the mg locus. As mi-ce change 
color slightly at each hair molt and because the phenotype of 
mg/mg vs. mg/ + can be subtle, all mice were phenotyped at the 
same age by a single person. Genomic DNA was made from a 
tail biopsy of each mouse and analysed for multiple simple 
sequence length repeat polymorphism 4SSLP) markers. The first 
-100 mice were typed for a series of polymorphic Mit 
genetic markers (Deitrich, W. F. et al . , 1996, Jtfature 3fiO:149- 
152) from distal mouse chromosome 2 in order to accurately 
delimit the position of mg. With the first -100 mice it was 
determined that mg mapped approximately lScM proximal of 
Agouti between markers D2Mitl9 and D2Nds3 (FI<3. 13). All 
remaining animals were genotyped for D2Mitl9 and £>2Nds3 . 
Animals recombinant in that interval were typed with ail 
available Mit markers between and for the ever growing number 
of markers developed during the project which, finally 
totaled 265 markers. 
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9 ' 2 ' RESULTS 



The murine mahogany (mg) gene is known tQ ^ & 

dosage dependent manner within the agouti pathway, to 
compensate for the agouti overexpression and for lack of 
5 sagnaling from the nul allele Mclr (Miller, K A et al 

1997, Genetics .,6:1407-1415; Dinulescu, D . M . et al Proc 
Natl, Acad . sci., in press; Robbing L.S. et al . , 1993 Ceij , 
72:827-834) . The phenotype of mice homozygous for both mg 
and a null allele of Mclr (recessive yellow, Mclr*) is 
10 yellow, the same as the phenotype of Mcl^/Mclr* mice 
indicating that mg is not acting downstream of Mclr \ 
«mal«r experiment was performed with obese Mcr 4 Knock out 
T Ce M \ * F ° r b ° th — s, all the animals homozygous 
or „e,,-/- were approximately equally obese and were heavier 
i5 nan- the mice wild-type at Mc<r independent of the genotype 
for m 9 . This data strengthens and confirms the „clr dat^ 
prev.ously published, strongly suggesting that m 9 acts at or 
upstream of both melariocortin receptors. 

To test whether m g acts specifically within the agouti 

pathway, experiments were perform^ " i-« * + ■ 
20 periormed to determine whether ma 

can suppress the obesity of Qther mcnogenic 9 

the mouse and wh.ther it- cou ld suppress diet-induced obesity 
Appropr.ate 9 enetic crosses were set up to product mice 
segregating » 3 and one of the mouse obesity mutations Cpe". 

2s *' Pr leP SUCh a " homozygous and ' 

25 heterozygous animals were on the same mix of genetic 

background. Nc suppression of obesity was seen for any of 
the monogenic obese mutants , P10 . „, lending cr 
assumed^ specificity of .action within the agouti pathway. T o 
ask whether mg can suppress diet induced obesity C3HeB/FeJ- 
30 mg- and C3H/HeJ mice were placed, at weaning, either on 
normal chow having a physioiogical fuel value ,ppv, of 3.63 

?:/r r th 9% fat - or ° nt ° a hi9h ^ . ™ «* 

4.53 kcai/gm with 42.2* fat. Pood consumption and body 
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weight were measured weekly. Converting the grams of food 
consumed to calories indicated that C3H/HeJ' mice on normal 
chow and high fat diet consumed -97 kCal/week and -36 
kCal/week, respectively. C3HeB/Fe J-zngr 3 * 7 mice on normal chow 
and high fat diet consumed '-83 kCal/week and -61 kCal/week, 
respectively. Despite the equal calorie: intake, the C3H/HeJ 
mice on the high fat diet readily gained more weight than the 
C3H/HeJ mice on normal chow <p=0.0004). In stark contrast, 
the C^HeB/FeJ-mgr 3 - 7 mice on "either diet showed no statistically 
significant difference in weight (FI*G . 12D) . Female data 
showed the same trends, although there was no statistical 
significance between any of the mice on either diet. 

io. experiment;: mapping and sequencing 

OF THE MAHOGANY GENE 

This section describes experiments wherein the murine 
mahogany gene was genetically and physically mapped to an 
approximately 0 . 6 cM interval, and then sequenced. The 
murine mg sequence obtained was then used to isolate! and 
sequence the human mg gene. Northern and in situ analyses of 
mg expression in mouse tissue are also described, and 
sequence motifs of the predicted MG polypeptide are 
discussed . 

10.1. MATERIALS AND METHODS 

Physical Mapping: More than 36 , 000 individual sequences 
from the region were compared by BLAST (Altschul, S.F. ^st 
al. t 1990, J. Mol. Biol. 215:403-410) to publicly available 
sequence databases and analyzed using GRAIL «*uan, X. et al , 
1992, Proc. Eighth IEEE Conference on Al Applications z 9-13) 
to identify potential coding sequence. In addition, 
sequences from overlapping BACs were assembled using phrap 
(Sing, C.F. et al . , 199,8, Genome Res. 8:175-185; Ewing B. arid 
Green, P . , 1998, Genome Res . 8 : 1&<6 -194; Gordon , D . iet al 
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1998, Genome Res. 8 • l9^^onr>\ : - 

their relationships to each other. 

10 RNA was' extracted from 

the tissues mdicated £rom wild-type, C3H/Hea and the three 
mutant alleles of mg, C3HeB/FeJ-mg« LDJ/Lp ' 66 

^ j ' LDJ / Le -™^ and C3H/HeJ- 

mg". according to the manufacturers 

60 (Tel T^r t S lnstr ^ tl ons. RNA STAT- 

(Tel-Test, me, 1511 Sounty Rd 129 pw. V 
„r,,, ... y Q * - L ^ 9 ' Friendswood, TX 

' Gaither sberg, MD) . Filters wer P 
washed with O.llx SSC, 0 1% SDS ar , H ^ 

film overnight. •" P °" d to KODAK X-omat 

10.2 . RESULTS 

25 the A P ° Siti0nal Cl ° nin9 ^tegy was undertaken to identify 
the mg gene. Multiple genet ic crosses were set un ^ 

second generation mice (n-2437) seo. ? 

^ 2437) segregating mg which were 

used to genetically localise the mn t , 

y anse the mg locus (FIG 13B) . when 
the genetic map critical interval for mn 

_ 0 , w , . , nerval for was resolved to 

-0.6 CM physical mapping was initial 

into random shea^d Ubraries for S L ""*' 
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coverage across the interval had been achieved -and that all 
genes within the region had been found. Twenty-nine genes 
were identified, 15 of which are novel genes. Within the 
final minimal interval for mg , indicated by the arrows in 
5 FIG. 13, there were eleven genes of which nine were unknown. 
All of these genes were tested as candidates for mg by 
examining the three mutant alleles of the mahogany locus, *he 
original allele, mg , that arose in a stock of Swiss x C3H 
mice, and two - alleles that have independent ly arisen on the 
io C3H background, C3HeB/FeJ-/ng^ J //ng JJ and C3H/He-/n^//ng i . Each 
gene was examined by Northern blot analysis and RT-PCR 
analysis, of RNA from tissues from wild- type and' -mg mutant 
mice, by Southern blot analysis of DNA from wild-type and mg 
mutant mice, and by SSCP analysis of genomic PCR products 
designed to cover the intron-exon boundaries of much of each 
15 of the genes. In all, 20 genes* were analyzed in this manner, 
one of which, showed a northern blot difference between the 
wild type and mutant alleles (FIG. 14). 

The wild type expression pattern of this gene gives 
three bands of size ~9 kb, 4.5 kb, and 3.8 kb, of which the 
20 larges message is the most prominent {FIG 14) . The smaller 
two bands can be seen in all tissues but , depending upon 
tissue, may require extended exposure. Each of the different 
mg alleles gave a different expression pattern. C3HeB/FeJ- 
mg 3J /mg 3J has extremely low expression, the 9 kb message only 
25 being very faint in brain, hypothalamus, and fat on 

northerns. C3H/ He -mg L /mg L expresses a single aberrant band of 
approximately 9.5-10 kb in kidney, heart, muscle, fat, and, 
most prominently, brain and hypothalamus. The LDJ/Le-mg/mg 
shows an altered ratio of the three wild type, messages : the 
• 9 kb message is reduced, while the two smaller messages are 
more highly expressed, in particular feeing very abundant in 
fat and hypothalamus. In situ analysis was used to look more 
closely at mg expression in £he 4>rain and specifrcally the 
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5 alleles revealed difference, in th 

have reduced vhh expression (PIG. 15) which is 
.interesting as many neuropeptides and rtclorsT " 15, 
involved in body Weight regulation eCePt ° rS kno ™ '» ^ 

10 including ttc4r. 9 °" are in the VHH. 

... initially, two overlapping mouse cDNAs of 1051 k / 

2«S bps were identified Usin 0 fk bpS a " d 

_ ^iec. using these cDNAs aq =, ^ • 

Poant it was possible to build over 7990 bos , ! Sta ^S 
sequences, using both the public EST dlab 

database, as well as idP n, f database and an in house 

identafymg one cDNA clone fr nm „ . 
liver library. The 23 ESTs used in fh a human 

in Table I below n • ! contiging are listed 

then pos S ibi ::; S t rL": derived ~, it was 

-e mouse ^ ^^^ 
amplication and syncing .^ZT.^: 1 - ^ 
20 seguence was obtained, of which 40 li bp'are coa ! ^ 
The mouse genomic 3 r,o„ c coding sequence. 

identified ^ kb ' *» d »>«s 31 

.identified exona, at least one of which is 

spliced. 1S dlf ferentially 



5 TABLE I 

gene Bank ftrrossion # Clone_iD_ft clon 



15 



NA 

AA062169 
NA 

AA350292 
R87660 



e Sourne 

NA 



Human Endothelial Cell 
(MPI) 



482948 
NA 



Soares mouse P3NMF19 . 5 
Human Liver (MPI) 
151062 infant Brain 



194640 



J ^fII FeCal Wver 
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T69367 82898 
T92696 118881 



Strataoene Liver 
Stratagene Lung 
Soares Infant Brain 1 NIB 



H11351 47626 

AA350293 15 1062 infant Brain 

AA297697 149 184 Eetal Heart II 

AB011120 - ■■ NA Human Nale Brain 

AA297214 129808 Embryo, 12 week I 

AA298732 184690 T-Lymphocyte 

AI076479 1676623 



Soares Total Fetus Nb2HF8 
10 9W 

AA771958 / 1359202 Scares parathyroid tumor 

NbHPA 

R84298 . . 194640 Soares Fetal Liver Spleen 

1NFLS 

081046 1178923 Human Fetal Brain 

15 (Tfujiwara) 

AA3 78 6 03 183010 Synovial Sarcoma 

060710 962349 Clohtech Human Fetal 

Brain (#6535) 

020236 pml235 Human Promyelocyte 

AA345684 147210 Gall Bladder I 

20 H45413 182870 Soares Breast 3NbHBst 

AA044305 486349 Soares Pregnant Uterus 

NbHPu 

. The mutant mahogany alleles were also sequenced, 
checking all intron-exon boundaries. A 5 bp deletion at 2S09 
nt was found in the coding sequence of the mg gene from 
C3IteB/FeJ-im^/ n *» which introduces a stop codon a position 
937, two codons 3- of the deletion. This mutation will 
result in a seriously truncated protein lacking many 
interesting domains, as discussed below. The mg" allele is 
30 the same allele that showed extremely low expression levels 
The combined Northern blot analysis, in situ hybridization 
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analysis, and sequence analysis of the mutant mg^ allele 
strongly suggest that this gene is the mouse mahogany gene. 

The 4011 bp of open reading frame (ORF). of mouse MC- 
predicts : a 1336 ammo acid polypeptide with molecular mass of 
148.706. D. (FIGS. 17A-D, top sequence). BLAST searches of the 
NCBI.and SwissProt protein databases identified two human 
paralogues with a similar modular architecture (KIAA0534 
Genbank accession no. 3043592; and MEGFS , Genbank accession 
no. AB011541), as well as a C. elegans homologue (YC8:_CAEEL, 
Genbank accession no. 019981). 

10 Another human protein, Attractin or DPPT-L (Duke-Cohen 

J-S. et.ai., 1998f Proc , Nat2 _ Acad _ sc .. USA 55;11336 _ ' 

11341), appears to be a 1198 amino acid residue 
approximately 134,000 D, secreted splice variant of the mg 
polypeptide. An alignment of the predicted MG (top) and 

1S Attractin (bottom) amino acid sequences is shown in FIGS 

17A-D. Attractin has not identified as being involved in the - 
regulation of body weight, Rather, the protein is reported 
to mediate an interaction between T lymphocytes and monocytes 
that leads to the adherence and spreading of monocytes that 

2o become foci for T lymphocyte clustering (see Duke -Cohen et ' 
a 1 . , supra) . 

Searching the MG polypeptide with the SMART domain tool 
(Schultz, J. et al., 1998, Proc . Natl. Acad. Sci . U.S. A 
55:5857-5864) revealed sequence motifs that may provide 
further clues to its biological function (FIG. 16B, FIGS 
25 17A-D). The single transmembrane spanning MG protein has a 
large extracellular sequence of 1289 amino acids containing 
three EGF domains (Nakayama, M. et aL, 1998, Genomics 51-27- 
34), two laminin-like EGF repeats, a CUB domain (Bork, P and 
Beckmann, G. , 1993, Mol . Biol . 231:539-545), a C-type lectin- 
30 domain (Drickamer, K. , 1995, Na t . Struct .Biol . 6:437-439- 
Weis W. I., and Driekamer, K. , 1996, Ann . Rev. BiocW 
65:441-473), two plexin-like repeats (Maestrini, E. et al . , 

. - 91 - 



BNSDOCID: «WO 0005373A3 IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 00/05373 



PCT/US99/16484 



1996, Proc . Natl. Acad. Sci . U.S.A. 93:674-678) , and six 
consecutive kelch repeats (Bork, P. and Doolittle, R.F., 
1994, J. Mol . Biol. 236 : 1277 - 1282) . Multiple EGF domains are 
commonly found in Type-1 membrane proteins involved in cell 
5 adhesion and receptor- ligand interactions (Schultz, J. et al , 
1998, Proc. Natl. Acad. Sci. USA 55:5857-5864). Laminin-EIGF- 
like modules are found in a variety of proteoglycans such- as 
perlecan and heparin sulphate proteoglycan. As CUB domains 
also frequently occur in glycosylated proteins and c-type 
lectins are known to be carbohydrate binders, it is likely 

10 

that MG is heavily glycosylated and -that carbohydrate 
interactions are essential for its function. Many kelch 
motif containing proteins have been found that, like MG, liave 
multiple consecutive domains. Such consecutive four-stranded 
P-sheet Kelch motifs form a bladed beta "propeller fold" that 

15 is common in many sialidases and other enzymes (Maestrini, E. 
et al . , supra) . Unlike the other well recognized domains, 
the "plexin" repeat is less well defined. It was first 
recognized as a triple repeat in the Xenopus gene plexin that 
has similarity to MET (Bork, P. and Beckmann, G., 1993, Mol . 

20 Biol. 232:539-545). Since then, this cysteine rich repeat 

has been found in 6 MET gene family members, three of which 

signal via tyrosine kinase and three of which are 

hypothesized to have putative signaling function via a novel 

conserved cytoplasmic domain. However, it is fascinating 

_ that there is an eight amino acid stretch that i*s 100% 
25 

conserved in the four proteins shown in FI*G 16A from human, 
mouse, and C. elegans . The conservation of sequence across 
such widely evolutionary divergent species strongly indicates 
a functional domain, possible a putative signaling motif. 

The multi -domain structure of MG is complex, but draws 

30 

many similarities from receptor and receptor-like proteins. 
The full-length MG polypeptide is predicted to toe a large 
membrane -spanning protein with multiple .extracellular domains 
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that may have a binding or gathering function as well as a 
highly conserved putative signaling motif in the cytoplasmic 



tail 



5 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
as single illustrations of individual aspects of the 
invention. Functionally equivalent methods and components 
io are within the scope of the present invention. Indeed 

various modifications of the invention, in addition to those 
shown and described herein, will become apparent to those 
skilled. in the art from the foregoing description and 
accompanying .drawings. 

All publications and patent applications mentioned in 
the specification are herein incorporated by reference to the 
same extent as if each individual publication or patent 
application was specifically and individually indicated to be 
incorporated by reference. 



20 



25 



30 
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WHAT IS CLAIMED. IS : 

1 . An isolated nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NO:' 1 (FIG. 2A) , SEQ ID NO: 8 

' (FIGS. 8A-C) , SEQ ID NO: 10 (FIG. 9), SEQ ID NO: 12. (FIG. 
'ID), SEQ ID NO: 14 (FIG. 18A), SEQ ID NO: 16 (FIGS. 19A-C), 
'5 or SEQ ID NO: 18 (FIG. 20A-B). 

2. The isolated nucleic acid molecule of Claim 1, 
..wherein the nucleic acid molecule comprises the nucleotide 

sequence of SEQ ID NO: 1 (FIG. 2A) . 

10 

3. The isolated nucleic acid molecule of Claim 1, 
wherein the nucleic acid molecule, comprises the nucleotide' 
sequence of SEQ ID NO: 6 (FIGS. 8A-C)- . 

4. The isolated nucleic acid molecule of Claim 1, 

15 

wherein the nucleic acid molecule comprises the nucleotide 
sequence of SEQ ID NO: 10 (FIG. 9) ."' 

5. The isolated nucleic acid molecule of Claim 1, 
wherein the nucleic acid molecule -comprises the nucleotide 

20 sequence of SEQ. ID NO: 12 (FIG.. TO) . 

6. The isolated nucleic acid molecule of. Claim. 1, 
wherein the nucleic acid molecule comprises the nucleotide 
sequence of SEQ ID NO: 14 (FIC ISA). 

25 

7. The isolated nucleic acid molecule of Claim. 1, 
wherein the nucleic acid molecule comprises the nucleotide 
sequence of SEQ ID NO: 16 (FIGS. 19A-C) . 

8. The isolated nucleic acid molecule of Claim 1, 

30 

wherein the nucleic acid molecule comprises t-he nucleotide 
sequence of SEQ ID NO: 18 {FIG. 20A--B) . 
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•■ 9. A vector compnsing the isolated nucleic acid 
molecule of any one of. Claims 1-8. 

10. An isolated host cell genetically engineered to 
express the nucleic acid of any one of Claims 1-8. 

11- An isolated nucleic acid molecule comprising a ~ 
nucleotide sequence that hybridizes, to" the . complement of SEQ 
10 ID NO: 1 (FIG . 2A), SEQ ID NO: 8 (FIGS. 8A-C), SEQ IB NO • 1 C 
(FIG. 9), SE0 ID NO: 12 (FIG. 10), SEQ ID NO : 14 ( FIG 18A) 
SEQ ID -NO: 16 (FIGS . 19A-C), or SEQ ID ' NO : 1 8 (FIG . 20A-B) ' 
under stringent conditions comprising hybridization in 0 5 M 
NaHPO«, 7% SDS, 1 mM EDTA at 68 °C. 

15 

12. A vector comprising the isolated nucleic acid 
molecule Claim ll. 

.13. An isolated host cell genetically engineered to 
express the nucleic acid of Claim 11 

20 

14. A method of producing a mg gene product comprising 
cultunng the genetically engineered host cell of claim 10 so 
that the mg gene product is expressed in cell culture/ and 
recovering the mg gene product from the cell culture. 

25 

15. A method of producing a mg gene product comprising 
culturing the genetically engineered host cell of Claim 14 so 
that the mg gene product is expressed in cell culture, and 
recovering the mg gene product from the cell culture. 

30 

• 16 - An is °lated gene product encoded by the nucleic 
acid molecule of any one: of claims 1-8. 
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■17. The isolated gene product of Claim 16, wherein the 
gene product comprises the amino acid sequence shown in 
Figure 2B (SEQ. ID NC . 2), Figure 8D (SEQ. ID NO. 9), Figure 
9 (SEQ . . ID NO. II), Figure 10B (SEQ. ID NO. 13), Figures lfiB- 
5 D (SEQ. ID NO. 15), Figure 19D (SEQ. ID NO. 17), or Figure 
2 0C (SEQ. ID NO .19). 

18. An antibody that immunospeci fical l.y . binds the gene 
product of Claim 16 . 

0 

19. A method for diagnosing a body weight disorder in a 
mammal, comprising: measuring the level of mg gene 
expression in a patient sample and comparing .the level to 
that of a control sample, so that if a difference between the 
level's is detected, a body weight disorder 'is diagnosed. 

'20. A method for diagnosing a body weight disorder in a 
mammal, comprising detecting a mg gene mutation contained in 
the genome of the mammal that correlates with presence of the 
disorder. 

) 

21. A method for diagnosing a body weight disorder in a. 
mammal, comprising: measuring the Level of mg activity in a., 
patient sample and comparing the Level to that of a control 
sample, so that if a difference between the levels is 

> detected, a body weight disorder is diagnosed. 

22. A method for identifying a compound that modulates 
mg activity, comprising: 

a. contacting a compound to a cell that expresses a mg 
^ gene ; 

b. measuring the level of mg gene --expression in the 
cell ; and 
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c comparing the level obtained in (b) to mg gene 
expression level obtained in the absence of the 

compound ; 

such that if the level obtained in (b) differs- from that 
5 obtaaned in the absence of the compound, a compound that 
modulates a mg activity is identified. 

23. A method for identifying a compound that modulates 
a mg activity, comprising: 

io a. contacting a compound to a cell that contains a mg 

polypeptide; 

b. ■ measuring the level of mg polypeptide or activity 
in the cell ; and 

c comparing the level obtained in (b) to the level of 
15 ^Polypeptide or activity obtained in the absence 

of the compound; 
such that if the level obtained in (b) differs from that 
obtained in the absence of the compound, a compound that 
modulates a mg activity is identified. 

20 24. The method of Claim 22 or 23 wherein- the compound 

identified is capable of treating a body weight disorder. 

25. A Pharmaceutical composition comprising the 
compound identified by the method of claim 24. 

25 

26. The use of the pharmaceutical composition of Claim 



25 for treating a body weight di 



sorder in a mammal. 



27. The use of the antibody of claim 18 for treating a 
body weight disorder in a mammal 

30 

28. The use of a mg antisense, ribozyme or triple helix 
molecule for treating a body weight disorder in a mammal. 
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AGATTTTATGCCTTCGTACACGCCTCCCATAAGAT6GACAAG6TGTACTA 

ATTACTGCCA™CTGTTGCTGACCCCAGA€GTCAATGTCCTCACAT<3GC 

CTCTACTGGCACTGTCTGGGCAGAAACTGTATATCCAACT-GGTtaAACCTG 

AAAGCCCTATGACTACTTGGTGTCTCTGGTGCTAACCCTAGTCGTTGGGG 

CATCTTACTGTATCCTGGTAAGGAAAGACATCCAGGCTCCCCACTTAYMK . 

WWAC YRGYWMRGMYC AKGS YMGRGC YAAWKTKCTGTRRMRTCTGGCT4BGC 

ATAGAGACATTACTATTGAMGTTTTGTCTTTCTAAATCCTTGGACTAAA 

GAGAGCACAAGATTTTCTGGAAGATCTTGCTTTAAAI I I 1 1 I 1 1 I IATTC 

TTTTGAGATGCTACATATAATTAGAGGCCCTCCACATGGAGGCGAGAAGC 

CCACCTCTGGGCTACATCCTACGTCTTTTCCTTAGGGTA1 H I I I I I ICT 

TTCTTGTACCTATCAGTATTACTAAGTTGCAAATGTGCTCAGCAGTAAAT 

TTAACATACATAGGCAAAAAGAAAAGTCTCAGGACACCCTGCCTCACACT 

GTTTACTGTGCTCAGGAGTACTGAGCCATACTGTTTTCTTGCTGCTGCTT 

TTTTTCTCTTCGTTGTTTACACACACT^^ 

TAGTATnCMTTTTTTCTTAGGTCAGCAAGAMGCTCACAGAGGAAGAG 
TGCmGCTCCCAGCCTGATGACCT^GTGACCCAAGTGATCTCACCTAC 
AGGGTG^GAGCACAGCACAGCATTCCAACTGTTTTTCTGAGCACACAGGC 
ACTATGGCACACAAACACACA<3GATACATAMTGTTAAAAAAAAAAAAAG 
ACTmATATTmCTCCATATAAmAAAAGAmtTCTTTCAACATTC 

cttttgcaaa<£agtatcattgtgtto 
tt<atcttcmwctaaattttta<^^ 

FIG.3B(1) 
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TACTACTTTCTCTAGTAAACTCTCCTTTCATATTACACATCGCTCTCCTG 

TCACCTGTTITAGAGCTGTCATCCATTTTATAAGGTTACTTCACTGTTCT 

ACACTACT7TGTGTCTT7TAATTACTATGCCTGGGGTGATTCAAAAACTC 

TCTGTGATGGGTTGGTTGAGGATGGCTCCAATAGGTTCAACACTTGGTCC 

TGATTGGTGGMCTGTTTTGGGAAGGATTAGGAGGTGTGACCTTGTGGGG 

GAGTGTGTCACTGGGAGTGAGTGACCTTTGAGGTTTCAAAAGCCCATGCT 

AGGCCCAGTGTCTGTCTGCCTGTCTGTCTGTCTCCTCCCTTTGCTCTTTC 

TTCCCTCCCACTTGGTTGCAGATCAGATTCGAGCTCTTAGCTACTGCTCC 

CGTGCTGTGCCTTGCTGCTACCATGCTTCTTGCCATGATGTTCATAGACT 

TACTCTCTGAMCTGTAAATMGCCCCCTMTAAAATGCTTTCTTTTAAA 

ACTGCCTTGATCATGGTGTCTCTTCAAAGAAATAGAACATTAACAAAAAC 

ACTATACCAAACTGCCTAATAGTCCTACTAATTTTATGATGAGTGCTAGT 

GCTTTATMTCACTAGMGAAAAMTTTCCAGGCCATAAAATTAACATGG 

TTTTMGTATGTATAAATCTTGTCTTGAAATCTGTTTFCTATAACTAACT 

CTMTATGATAATGTATATTCTACCTTCAAAAAAGCACAAATAAGACTTC 

AAACCCTGGGAATTGTTAGACAAAGGCCATTTAATACTAATAAGCTATAA 

ACTGAAACCATCTGATATATGAAAACTATTAATAAAATCAAGATAAAATA 

ACCCCTATTTATATAACTTACTATATACCTAAAGCAAAATATCAAAGAAA 

GTACCTTAAAMGATAAATTATTCTTATTTTGACAATCMTTCTTTGGG^ 

CGTTAAATTGTAGAATATCAACACATATCAAGAAAGTTTAGAAGAAAACT 

ACCAMGTTTAMCAGACT7TCCTCGGTMTTACTGGTGATTTCTTGGCT 

FIG.3B(2) 
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I I I i I I I I I I ACACTCCAGTTTTTCAGGGT-GGAAACTTAAGCTTTGTACA 
GAAGCACTTACCACCACTCTCAGAGCT6GAAATGGCTCAAAGGGCAAAGC 
ATTACAAGCCTGGCAACCTGAACCAAATACCCAAAACACTTGCAAAGGTG 
AAAGGAGAAAACTAACTCCAGGAAGTTGTCCTTCGAGCTGCTCTTGCACA 
CCACTGTATACACCCCCTTATATACACTCAGTTAGCATAAATAAAATGTT 
TCATTATAAAGACACTTACGCTAAAACCATGCTGTAATCTGAATGGTTGA 
ACATATATCCGGCAACAACCCACATTATATTTCCATTGACCACAGCTTTA 
TGAGAGGCTCTGGGMGCTTTAAATCAGAATATTCTTCTGGAGTCCAAAA 
AGACTGGTTAGCTGGCACAGGMTTGAGCATCCAGGACCTAATAAAAAAA 
AAAAAAACAACAACAACAACAAATAGCTTCACAAAATGCAGCCTGAAAGT 
mTAGTATTCCAAGTTCCAATCTAAGTGCAAAGAATATTTAAAGACTT-G 
TGGGGCTAGAGAGATGGCTCAGTGGTTAAGAAAACTGACTGCTCTTCTTG 
GAGGTCCTGAGTTCAAATCCCAGCAACTACATGGTGGCTCACAACCATAT 
GTMTGGGGATCTGATGCCCTGTTCTGGT<3T«T<:TGAAGACAGCAACAAT 
GTACTCACATGAAATAMTAMTTMTTTTTTAAAAAACAGACCAGAAAA 
AAAAAAAAAAAAMGACTTGTGTTTCCTTTAGCACTTAAGC-GCAAACATC 
TTTMCTTGTGGi&TTTTAMGGTTTTTA^ 

CATGTATGCCTATATACCACTTGCTTGCTT^TAC(XMTCAT<3TCAGGA 
AMGGCATTGMTCCCCTGGAACTAGAGTTACA<aATCTTATGAGCTACTT 
TGTGGATGCTAGGATCAMGCT^GTCCTCTGGMGAGCAACCAGTACTC 
TTAACCM<3AAGCCATCTCCTTAG^ 

FIG.3B(3) 
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CAAGATACAGACCAAAACCAATCACTCCCTTATAAAATTTAATACTACAC 
ACTTTCTGATMTTTGGCMTTTCTGATMTCAGGTTAMCTTTTTTAGA 
GGTAAAAATCTTGCTGAAGCAACATTTAGTAGAAAGGGTAGACCAAGGGG 
™mTAnMCTCATGTGGAAAAGGCATTAGGGTTGAAATATAATGAC 
AGATCAAAATCGATCTTCTGGCAAGTCCAGGCGCTGAATAGATGAAAGAG 

acaaagggagaattggacaaactaaaaacatttacatgaacacttacttt 
ctgaggacctmgcatagmggaaaatcactaaaccaacgatgactgctt 

CCTCAATACCCCAGGGAATTCCCTACAGTACCTTAGTACGCGGTTGTGTT 

gggtaatggcactagatgacagcactgagactctaaggaacgcttgtcct 

GCTCTCAGCTTGAGTCTCTGCTTCTCTATCACCAGACCATGTTCCCTAAT 

TCCCACGMTGAGTTGCAMGGATTTGTCAAACCTTTCCACAATTCTAAG 

CACATAGATAACAACCACATATATGTAAATTCAAAGAATCTGAATAAATG 

GAGATGAATGCTTAAATGCCACCTGATACATGATTAACATAAGGCGTATG 

GCTGCTAAMTAMCTCCCTACAGTTCACTAACTCAGAACTTTCTGTGAG 

GGAAAGGACTTTGAAGGGCAGCTGCTACCCTGCCAGTGAGGAAAGCAGGA 

GCACCCTCTGGTATCGCTTGCATTACAGATGCCTCGGTGAGGAAAGCCAC 

AGTTGTCTGTACAGTGAGGMTGTGACACGACTCCCCTTTCCAGTTTTCA 

GAACATTCACACTCAACAGCGCTGCTGCTGTTACTGCTCTTACACTCTCC 

TCGGCCTGAGCMTTATTCGGACAGATGTCAAAACTACAAAGACAGGAGA 

AAACGAAGTCAACAATTTCAACTAAGCAACATTGCAACTAATGCAGACCT 

TCCTGCTTCAGTTTMGTTCAGTTCATTT^MGTCTCAGTGGAGGACTT 

FIG.3B(4) 
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accagttagcccaagt6tgctcaca6a6ctctgtgtagctagagccccag 
gctcmgtmtgamtcamtcmccttgctgcattcacatatgaagaag 
gaagaataaataactcacaaagttagagaaattacaaaacaatagacatt 
tgtgcaaaatcacttagacttagctcaagactggcaaccaggat-cctact 
ctttctggtagctcattagtaaagagttctacaaaagcagcaaggtcatg 
ctaggaagtggaggaaggagaggaagccaatgagctgccaacattcacgg 
tatacatttctctgtaaagattctgagmttmcagmtttaagattatt 
ttccagtgatgtagttamggtctttagtmcttttatcagettagaagg 
agmgagcagttaacttcatgtatgagtttaagtgtctcatgacttaaga 
tmcagttngctacmtttgamtgccatacttcagactttttaaaggg 
gtccatogtcgacta™caatagctt/\aamtatagatttctcctact 
gatgattattactgagacactactagtctttattaaattcacttagcaaa 

ACTCCTGACATTTTCTTCCAGCAGCGGAAGAATGTCTCTCTCTTCTAGGA 

wcctcagtgacmgatctagaaagaccaagaactgtggtcccaaccag 

tggggctgatatttgtttmccItttagctcctgtttcttcaattatgm 

aaaaaaaaaaaagaagmgmgaamtccatgttaamtttagcaaggag 

cctgactagctagaagcctgcctcgaatatattagtgttattaagtcatt 

tgagtagtatcacamtattaaatctamtatctocttgtaagtgatat 

tamtccagtcagatotmgcagcatcactgaaaaaai^agcagtgca 

taacctgmgtg^agtgacctcaggagcggtctcattgccat^ 

aggaacmtgagggcaci^aaatgtaaacacagaccaga™ 
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TCAACAGAAACTGTCTATATGTTACTATTT6ATCCTGCTGCTCCTGTTCC 

AACACACACTGTAAATGTGACTCTAGCTGGCCTCAAATTCACAGACCCAC 

CTGCTTCCACCTCCTGGGTTATAGGCATGCGCTACTATGCCCAACATCTA 

AMGGATTTGAAATCTATGACTTTGATTGMTTTnGGTTTTTTGTTT^ 

GCTATAMCTTTTTATTATAATACTCTGAAGTCTCTACAATAACATTATT 

MCAMCTTTATGAATTGACAACTGTCAAATATATACTGTTGAAAGAAAA 

TACTTTACATATTTrTGTMTATGTATCATATMTCTTTTTAATGTATTT 

TATAGATGTCTTATATMGTAAAMTAGAAAAGTTTACTGATTTATAATC 

CraTACTATOGCTTTCAGACGTATTm^TTGTTAAACTGGTAACACA 

TTTTATGTTTATAATTCACAATAAGCACTGcicACTGAAGGTGCCAAAGGC 

TCCCTAGAATCTCAGTAAGAACCTAGTGGGTAATATTTGAAGTTTrGGAT 

GCCAGTAAATTCATGTGTAAAGATTTATTGAGTAAGTGACTACCAGCGGG 

ACAGTGGTGGTGCACGCCTTTAGTCCCAGCACTTGGGAGGCAGAGGCAGG 

CGAATTTCTGAGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCCAGGA 

TMCCAGGGCTACACAGAGAMCCCTGTCACGCTGTCTCAAAAAAAAAAA 

MAAAAAAAAMGMTATACCATTTTTMGGCATTTGATCCACAAAATCA 

TACCACCTTGTTTTACAAMGATATATATTAACTTGAAGGCTGGAAATGG 

TGGCACATGTCTTTAGTCCCAGTATTGGGAAGACAGACCCAGATGGATCT 

CTGAGTTCAAGACCAGCATGGTCTACATAGTGAATTCCATGTAAGTTTGT 

CCGTGTGTGTAACTTGAAACCTCATTATAGAATGGAAGTGTCTACCCCAC 

CCCACTTACCAACAGTAAGGMTATTATGTTGGTCCCGCTCATTTAATAC 

FIG.3B(6) 
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ATGGTGTACTCCCAAGGTAAATCATTTTCATGTTTAGTCGCTCCTATTAT 

TTTTTCCATTATCAATTCACTACAACTACTACCACCAATCACATTTAGCC 

ACTAGAAAAGCCATGTGATTTGCTCCACACATACAACTTCACTCAATAAA 

TAAACATCTTATCAGTACTACTCTCTCTTTCAGTCACTCAATCCCTAGTC 

CCCTMGTTTnGGACGATTACACCAGGTAAATTCCTACTTCAGGGTTAT 

GACCATCTTAAAMCTACGACCTAGCMTTCTCTTTGTATAAGAAATACT 

TCCCCGTATATAGACAGAAAAACAAAGAACACTACTACAGCACTATTCAG 

ATG^CMCTGACTAAMGTCACCTMTTGCTTATTTATGGGAGTTGATTA 

AATTAGTCATTACAAATCTGTAGGTCTGCAAGACTAACCAAGAGCTTGGT 

GAGGACAATAGGTAGGGCTACCCAGAGAAACCCTGTCACCCTGTCTCGAA 

AAAAAAAAAAAAAMGGGAGGCACAGAGAAAAAACAACAGGCCGGGGGTA 

CCTGTACATCTATGTAAGCGTAGGTACATGCACATAAAAGTGACTACAAG 

AGMCATAMCAGAGAGCGCCGAT^GMGAGGATGiaGATTITrCATTTA 

ATTTGCGTGTATGAGAGCACCTATATGTGCATGTTATCCGCACCAAAGTG 

TGTAGGGTACATTATGTGAGTGTGCCTGCAGAGGTCACTGTCAGGTGTCT 

TCAATCACTC<XCTCC7TTTTCTTCTG<aAGATMGAGTTTCATOA^ 

TACTGGCTCGACTAGMCTCACTAT^AAACCA€GCTC<3CCTTGMTTCT 

CAGAGAGCCTCTTGAGTGCTGGAATTATATGCATGTGCGGCAACACAGCC 

CACCTCATTTTGG^GGGTAGGATCTTTCACTGAACCT^GCTCACT<^TT 

GGTTAGAC€G<^T^GCCAGTAAGTTCCAGGAGCTCTCTTCTCTCC^CCT 

CnCAGCACTGTWCACA^ 

FIG.3B(7) 
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GTCCTGGA6ATCTAAACTCAGCTCTCCATGCCTGTGCAGAAGGAATTAAA 

CTGAGCCAGCTGTCTCAGTATCAAGAGAGAACATAGGAACTGTAAGATTC 

TGACAGTACTCTAGGGCTTACAGAACACCGACACATnTCTACTATGTAT 

TCAGTTAATAAAAGAATAAATACAAACAAAAAAACATGAGAAACATATAG 

AGGCAGAGACAGACAGACACACACACACACACACACACACACGCACACAC 

ACACACACACACACACGCACTTAGACGGGTGTGGGGGAAGAAAGAGCAAG 

GCCACCTAGAAACAGGTACGTTCCATGCAAATGATCACAGGAAAGGATTG 

GGGA7TTTTAACCACTTGTGGGAAATGCTGTACTCTCCTATTCTAGCACA 

GATTTGAGGAAAAAGTAGACCAGAGAGTCTGTCCTTCCACATATCCTGGA 

MGTCACTGACATGTCCAAGTTTTGATTTCTTCATAGGGACAATGAGAGA 

AACCCAGACTATCTCACAGCAGCACAGCAAGGACCAACCAGCAGAGCAGG 

AGAAGTGCTTACAGCAGTGTGCTGCTAGAAGGTGCAACAGTCTTCTTACA 

GAGGGCAT7TAAATATGCAGGATGGATAAGTTTGCCAACTACAACTACAG 

AGGCTGGACAAGGTAGGACAGCTTCTTCACTGTCAAAGACGTTTGGGCAG 

TTGCTTCTATTTACCTTAAAATCAAACTGTGACAGCTGTGGCATATATAG 

AmCTCCCAGMTGAAAACACATTAACTCACTTATGTGAATAATATGGA 

GTAAACACAAACATAGTCTATCTAGCTCAGCATGCAAGACATGTGAGGAA 

GAGGAGCTACTGTGAGTCCCTATCCCTGTCCCTAAGGAAAGCAATATATG 

TAAATGTAGTCTAAGCTGCAGGCAGTTCTTCAACTGCGTACCCCAGGCTG 

CTCACCAC7TCACATTCTAAGCACAGACTAGAAAGTATCATCAACCTCTG 

MCACTGTGCTATMTGTTACCATCAATCTCACACAGAAATTTCATAACA 

FIG.3B(8) 
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TTTTAAGTMGTCTATGATGATTCTATCTTGTGTCCCAGTTATATAAGAT 

CCATAGGTCACAGGGTAGACATTCMGGACACCAA^ATTTCGAATTTTGG 

Gl I I I I I IGGTGTACTCTATATACTT-GCTAGTGCAGGTACCCATGCTCAT 

GTGTGTAGMGTTGGGCGTCTnCTTCTATCACTGTCTACTTTATATTTT 

CTTTATTGTTTCATTTGATATGTATAGGTGTTTTGCCT-GCATATATGTGT 

ATGTTTGTTGCCAGMGAGGGTATTGMTTCCCTCGGACTAGAGTrACAG 

GTGGTTGTGAGGCACCATTATGGGTACTGGGACTCAATCCTCGGTTCTCT 

GGAAGGGCAGCCAGTACTTTTMTCACT<3AGCCATCTCTTTA<3CTTCCTT 

CGTTCATTCGTTCGTTCA1TCCTTCATTCCTTCATTCCTTCA1TCAGAGG 

ATTGAGATACCTOCTCAGTTAGGCTGGCTAGCCAATGGACTCTCGGAAT 

CTATCTGTTCAGCTATTCTCTCCTTCCCCATCCAAGTGCT<3GGGATACAG 

GCAGGTCCTACTGGGTTCATTTTGAAAAATTACAGMCT^TGTATTTTCT 

TCATAAATCTGAAACTCAGCATAACTGTCTCAGGCTAACATCGAATGGCT 

AAATATATATGA^GCACAACCTGACTTTACCAACTGTACTATGTAAATTT 

GCTAGTATATTAGTCAACACTTAATGGAAAAACATCTGATAAAAACAACT 

TACAGGCCMTAG^AAGGA<^CACTTCGGGA€GTGGATTCAA6GCAGTC 

ACTGGATTCTT<^TTTAAGTCCAGCCTAGGCTACAT<aA<aATTCT^TTC 

AAAAMTAMCAATTAMmATGGG<3GAAA<^^ 

CAGAMTTCCATCCTATCATCCMG^GAGATATT^TATAACAGCGAAGTT 

CCTCAGCTCACA<3CA€TCAGTAGCATATAGACAATCCT<aGCTCCAAGCCT 

ATtaAAMCACAGCCTOTACTAAAGGT<aTC 
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GTGCCCCCTAAGTCTTGT6TATTTGAATACTTGGCACTCACTTGGTGGCG 



ATTTC 



rGGGAGGAATTAGGAGGTGTGGCCTTGGTGGAAAAGGAGCATCACTA 
GGGTCAAGGTTTCAAAATCCTCCTGCCATCATCCCCAATATGTCCTCTCT 
GCCTCCTGCTTGCAGTTCAAGCTATGAGCTCTTAGTTACTACTTCCACCA 
CCTACCCCTGCTATCTCTGCTCCATCATCATGGACTCCTATTCTGGTGGA 
ACTGTTAGTCCAAAAMGTCCTTTCTTCTACAACTTGATTTGATGCCAGA 
TCTAGCCCCCCAGCCTAGCTAGCAATATACCAAGGTATACCATCTTGAAC 
TCTAGGTGTCTCTCAATCCAATCAAGCTACATAAGATTAACCATCATACC 
TAGTCATCCCCAAATCAGTGTATCTCTCTCCTCCCAAGACTATAAGCTCC 
TCAAGGGTCAAAATATGTAGAAAGGAAGAAAGATTCTCAAAGGTCAAGGA 
TCAGACCTTGGTGAGGATTGAGCACTGTCTACACTTTGCCTGGTAAAGAA 
GGGTCCACMTGTAAMGAGMCTGACCTGMCAGTTTTCAATTAGGTGC 
TAACAAATGTCTCATACGTATTGAGTTTCTTATAAATAAATAAATAAATA 
AATAAATAAATAAGCAAGCAAGCAAGCAAGCACTTAAGAGCACTAGCTCC 
TTTCTTCCTGAAGACCTGGTTTCAATTACCCAGCACTTATACAGAGGCTC 
ATACCAATTGTAACTCCAGTTTGATGATATCCAACATCTTCTTCTAGCCT 
TCAGACACCAAGCACCAAGCATGTAATGGTATAACACATGTATACGAAAC 
ACCCATACAMCCMTTTTTAAAAAAATATTCGAGCCGGCGTGGTGGGGC 
ACGCCTTTAATCCCAGCACTCGGGAGACAGAGGCAGGTGGAT7TCTGAGT 
TCGAGGCCAGCGTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTGGA 
CAGAGAAACCCTGTCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAT 

FIG.3B(10) 

— «wc___ TO SUBSTITUTE SHEET (RULE 26) 



WO 00/05373 PCT/U599/1.6484 

23/173 

AGTCATTTTAGGGCTGGAGAGATGGCTCAGG6GTTAAGAGCACTGACTGT 

TCTTCCAGAGGTCCTTAGTTCAATATCCAGCAACCACATGGTGGCTCACA 

GCCATTTGTAATGGGGATCCAATATCCCATTCTGGTGTGTCTGAAGACAG 

CTATAGTGTAAATAAAATAAAGAAATCATATAAATAAAATAAATAAATCT 

TTTTAAAMTATTAATTAACCCAGGCTGAACCTAAACTTACAAACTTCCC 

ACATTAGGCTCTTTAATGCGGGTGTtATAGGTCTGAATACCAGCTTAAGA 

ATMTATTCTTCTGAAGAATGTGCCCTGGTCAATCACCATGACCACACCT 

GCCMCAGGTCCTTCATAAAATACTTCGTATATGTTGAATGTTCCATAAA 

ATTATGGAGCTAGAAAAGGTAGTGAGCTAGAAGGATATTAAAGATATAAA 

CCATTGCCCCAGTGGTCCTCACATTTGTCTAGTMTAGAACGTT-GTTAAA 

CTGTTTTTATTTAGAATTTC AATATATAAAAGAC AAATATGAAATAGTC C 

GGMGCAMTTMGCTACAGCTTCCAGCAAAGCCAGATAGAATGCAGATT 

AMCTMCACAGTACCTTTGTCTTATGTTrTAGATGCTAAAGTCTAGTCT 

ACMCCCCAGCTCCCCTTGMCTCTTAGCAGTCCTCTTGCTTCAGCCTCT 

CATGCTGCTAGGGTTAAMGTATCTCGGACCACACACAGTm^GTTT 

AGAGCACTTAMTGATCTATTCAGCMCTCAGGCAGGATTTACACTGAAA 

GTAMTTATCTTATGMTCCTTTTT^TTTTCCTTTTATTCAT^ 

ATGCACCTTACATGAACTATCTATTGCTAGGCTGTCTCTATACTGGATGC 

TCAGCACATCA€CMCATGCe<^TTCTTCTACTGGTACAATGGCAAT'GCT 

GAGAAAACCACACMCCTAA6ACAGTAG<3GAGGTGGTC^^ 

GTCTTCTOTTCTTK^ 
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AGCCCTGGCTGTCCTAGAACTCACTCTGTAGACC6GGCTGGCCTCAAACT 

CAGAAATCCGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCGTGTG 

CCACCACGCCGGGCTCTGGTGCTCTGATTTTTAAATACAACAATTTTCAG 

CTAGCAATGTAACTCAGTAGTAAAATGCCTGCCCAGCATGCACAAGGCTC 

CAGACTGGACCCTGAGCACCACMCACTTT7TAAMGATGTGTTTATTTT 

A7TTTATGTGCATGAGTGTTTTGCTTACATGAATGTCTGCACTGTGTTTA 

CCTGGTGCCTGTGAAGGTTAGAAGGCAATGGAGCTATGGAGAGTTGTAAA 

CTACCATGTGGAAATGGAGCTATGGAGAGTTGTAAACTACCATGTGGGTA 

CTAGGAATTGAATCAGGGCACTCCTCTGCAAGAACAACAAAGGCTCTTAA 

CAGCTAAMTATTACTACAMCCCACACCACAAAATTTTAAATTGATAGA 

CATTATCACCTTAGTTCTAGATAGAGAATGTGCTTGGCATTGTAAGTACT 

AAAMGGTTTTGGGGTGGATCT7TTATATTATCTCACTATMTTTTATAA 

AATTMTACTCAMTATGTTATAAGTTMGGTTTTTATTTTTGTTTTTCA 

TTTCTGTATTTTGTCTATGTAGCTCTGCCTGGCCTGAAACTCATGGGAAC 

TTGACTGGGCTCAAACTCAGAGAGACCTGAACGGCCCTGCCTCCAAAGAG 

CTGGGACTAACCATCCCCAACAGTAGGTAGCTTTAATACCTAACCAGTGT 

ATTAGTTCATGCTCTCMTTMCCMCATTCTCTACATACAGAMTTTTT 

ATGCCTATTTAATGAAATACACAGTCTAAGTAAACTCTAAGTACAACTCC 

TTGGCTCATATTCTTACAATGGCTATGGCTAGCTAATTCAAAGGCCAGTC 

ACATAAAAGGGTCTCTATGAATTCTGATTAACAAATGCAGTrAAATAGAT 

GMTTCCTAAAAAGTAGTATCATAATMTATCATATTTAGTTTTTCTGCT 

FIG.3B(12) 
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TCCATTATAGTTTGAGGTGCCTCCTCCCATAATGCAAGGTATATTTCAAA 
TMTAGATATATACATGGTTAACACATGGCAAATGGCATTTTAAATGCTT 
AGCACAGCCTGCTCTTTGGCTCCATTAAGTGAAACTCTTAAGTTCTCAGT 
TAAAATAATTGTTGGAGAGCTATAGGAGCAATGGGTGGAGAACTAGTCTT 
CTMTTTGTCCTTTGCCTCCTTGCGTACTAAGTAGTGCCTCCCTCACTAT 
GTGGCATTCCAGCAGACTACCACCAAGAGAAGAACAGAAAAGTGTTGATT 
TCTTTCTAAAGTAAAGAAATAAGGGGCCAGTGAGATACCTCAGCAGGTCA 
AAGCCATTTGCCTAGAAACCAAAGTTCAATCCTTGGAAGCCCTGTAAAGG 
TGGMTTAGAAAACAGACTCCACAAAACTCTCCTCTAAGCTCCACTOGGG 
CACACATGTGCCAACCCCTCCATTCTCCCTCCCCCACATACAAAGTAACA 
ATAMCTTTCAGAAMTTTAAGTTGCTACGCATGGTGATTGATGAATGTC 
TTTMTTCTAGCTCTTGGGAAGCAGAAGTGGGTGGATCTCTCTCAGTTCA 
AGACCAACCTGGTCTATATAGTGTGTTCCAGGCATCCAGGACTACACACA 
CACACACAAMTTACGTGMGGMGTAGAATCTTT-GAAGGAAAGMGTCT 
GGAMTGGGGATCGAGAGAGACCTCAGCAATTAAGAAAAGGTCTTGCACC 
GGACGTGGTGGTCCATGCCTTTAATCCCAGCACTCGGGAGGCAGA6GCAG 
GCGGAmCTGAGmGAGGCCAGCCTGGTCTACAAAGTGAGTTCCAGGA 
CAGCCAGGGCTACACAGAGMACCCAGTCTGGAAAAAACCAAAACCAAAA 
ACAGAAMCCAGTATGATA^TCAGGCMTTGGATGGAGACAGGACACTC 
AAGATAGCTAt^CTGTGCMTATAGAAAGAAGTCTCATGGAAGAGAGAGG 
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6A6A6AGA6AGAGAATGAGAGCGAGAGAGCGAGCGCACCTCAGTTGATAC 

AAGATTGGGGCCCTGAGTTCCATCCCCAGCATCCCATAAATTGGGTGTAG 

CAGCACACACCTGTATCCCAGCAGAGAGGCAAAAGACAAGTTCAAAGTCC 

TATATGGAAAAAGTGTGAGATCAGCCTGGAGACCTGGTGTGTGGCAGTGG 

GGTGAGGGGTGTCATCAAGGAGAAGGCTTAGTAAGTAAAGGACCTGCGTT 

GGTTCTTGAGTTCAAGTCTCCAGCAATCAGAGAAAGCCAGAACCATTGCA 

CAMGnGTMGCCMGTGTTGGACTGGACAGAGACAGGCAAATGTTTGA 

GGTCCAGGTTCAGTAAGAGACCCTATCTCAAAAAATCTGATGGAGAGTAA 

CACTGGAAGAACTCAGAGTGAGTCACACATGCACACACAGGTGAATGTGT 

ATACAAAGGGGGCAGGGAGGGAGAATGA6AGGAGACTGGGAGATATCTGT 

AGTTCATGTCTGTAATTCTAGCAC7TCAGAGGCAGCTGGAGCTACACAGC 

AAGAGCCCGTCTCAAAAACAAACCCAAGCCTGACAGTGGTGAGGTACACC 

TTTAAGCCCAGAGGCAGGAGAATCTCTGAGTTCAAGGGCAGCCTGAGTGA 

GTTCCAGGACAACCAGGGCTCCACAAAGAAACACTGTCTTGAAAAAAACC 

AAAACCAACCAAACAAAAAAGAATCAAAAACAACCACCACCACTACAACA 

AAGCAAACAAGGGAGAAGGTATAAAATGCTTAGGAGAGTCTTCCTTTAGT 

CTCCATCCTTTGGGTACTCCTTCCCCACAGAAAGCCACTACTACCAATTT 

CTTACATAAGCTGCTGTTTTAGACACAGG I II 1 1 1 1 1 1 1 1 1 1 IA AATATA 

GTMCATATTCATGTGTAGCTCATTTTTCTAGTGAGTGGTTtaGTCCTTCT 
TTTMCAGTTTAMGGACCTCTATGTTTAAAGGCGATTGGCCCTTGTCTG 
GAGTATGGGTTGTATTTrCCCMTTTGTGAGTTTTACCCMCGTATTCCC 
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TATTACCTATGGCCATTTATTCTTGTC6ATAAGTAGTTTCCAATTGTATG 

ACTATGGTCACAGTGTTCCATGGACTCTTCTGCCGCTAGACAGCCCCTGG 

GTCTGMTTTGAGATGGTTACMGGGTGATTGGCTCTCCTCGCTGGGTGC 

TGGGATTAMGGCGTGCACCTCCACACCCMTTTIaTTCTGTTTTGTAAGA 

MTGAGGTTTTATTGTGTTGCTCAGGCTGATCTCAGTCTCCTGGCCTCAA 

GGTATCCTCCCATGTCGATACACAGCACAAGGC<3TAGGAAAAGTGGCAGA 

TTTTTTTAMTTMGTTTT€mCCAAMTATA^TTCAGAAATGTGAGA 

TrTTCACAMGTC^CCTGCTCACTTCCCTGGCTCTMGAATCTCCATTGT 

GGCTCCCGCCCATCCCTTTTGCCCACCAGTGGCTGTTGTATTGACTTCTA 

TCGCATTCCTTMCTATACCTCTCCTT(^TCTTCGCT€TGAACTTGCTTG 

GGCTGAGMTCACCTTGTTCCGGGCACATCAGGTCAGTGAGGGTGTTTCC 

AGAGAGTTTTAACAGAGACCAGMGACCCACTCCAAATGTGGGTGGCAAT 

ACCT^TGTTCTCTCATGCT<3GACT<^TAGXaAAGAGGAAAGTAAGAAGC 

AAACGGCACCCCCACCTCTCTGTCTGCTTCCTCGCGGACACAAAGTGACC 

AGGGCCTCCCACTCCTGCCCCCTCAGCTAGAGACACTTGCTCCCATCTTT 

CCAACCACTCTGAGACTGTGCCTACTAACGGTGACCCAAAATAAATGTTT 

CCTTCCTTMGGTT^CTTTGTTAGCTCCTTTAATAGAGGGGTAGGACAT 

GTMCTCCCACAGGCAGCCATGGGTGCCAGCCCCTCCCACTGACGGTCTG 

AGAACCACACTCA^TCTAG^ACAGCTCTCATAGCTCT^G^GCGTAGC 

TCTGTCTACTCK&TCAWCCCCT^^ 

GCTG^T^TCGGTAGCACT^TTATGMCATCCTAGTACAAATCTCAGGG 

FIG.3B(15) 
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TGACACGCGCCTTCATTTTTCCTGAGAAAATGGCCAAGGATAAAATGCTA 
GGGCCAAGGGAAGAATATTTCACCATTAAGAGACACTGGTCAGGACTGGA 
AAGATGGCCCAGTGGTTAAGAGCACTGACTACTCTTCCAGAGGTCCTGAG 
TTCAATTCTCAGCAACCACATGGTGGCTCACAAGCATCTGTAATGGGATC 
CAATGTCCTCTTCTGGTGTGTCTGAAGACAGTGACAGTGTACCTACATAC 
ATGAAATAAATAAATAAATATCTGAGAGAGACAGACAGACAGACACTGGC 
TAGTCATCTCACAATGTTCTCATGTTTAAAATATGATACCATTTGTATAA 
AGCAGAAACACAGGAAAMTAAAATCTGTGGTATTATATTTGATTTTTAA 
ATTAACTTGATTAGTGAAG7TAGCAGCTACACTGGGCAGGGGTTGGGAGT 
GGGGTACTCTGMGTGCTGGTATTTCTGGTTTTGTTTTTTGTTTGTTTGT 
1 1 1 1 1 lATCTTATTTATATTACATAGAAAGCCATTTTGCTAATACACTTA 
CCATGTGTATATATTGTGCTTGAA7TACAGGTAAGTAATTATTTCTGAGG 
GGCTTTAGACTACTGAAGATTGGGCCCAATGAGCCCCACCCCAAGTAGTC 
TCCAACATCCCTCTTGGAAGTACTTGAGAGCAAAGATTCAAGTCACATGT 
CCCCAAACCCTCAGCAGCCACCACCCTTTAGGTGTGGCTTTTGCTCTCGG 
TCATCCTGGMCATCTTGCCATCTTTGGTTTGTTCTCTCCCTGTCTTGCC 
TCTGGTAGAGCTGGGTTTCTGTGCTTCTATTCAACCATGTACAAGAACCA 
TGTGCCACCTGCCATGTGCCAAGCCTGTGCCAGTCCCTGTGAGCGAGCAG 
CCCACCCCGTGAGTTATCATGTGAGGAGCTATGAGGAGCAGGAAGGGGCG 
CGGATGACTTCAGCAGACAGTATGAAGCAAGCACTGTGCGATTTATGCTC 
CCTGGCCACATGCCCACAGATGGTGTCTGAGACACTAGCGTTTAATATTT 
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6AATTCTCCACATTCTAGCCTA6ACATTPPGGTTGCAAGAAGAAAATTGA 
CTCCAGTTGTATCCTGGMTGAAATTTATT<3GA€GAAAATACTCGACAGG 
CTCCCAGAGAAAATACGATATTCAGGCACAAAAAGAAATGGGGACTGAGG 
ATCTGMGTTCAAGGTCATCTGTAATGAGATTGAAGTCAGTTTGGGCTAC 
ATGGGACCTGGTCTAGGGGGAATGGGGAAGAGAAGGGAAGGGATCGAGAT 
AGGGAT 
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CAATGTGCTCTGACGATTAATGGGCTAGAAATGTGTGGCTGTTGATTAGT 

GAAAAGATGTCATGGTTCAGGAGATTGGTAGTCTCTGTGGGAAGACAACT 

CACTGAAAGGGAGGAAATAGCCTGGAAGAGATAAAGAGACAGTGATCAGC 

TAGGMGCTTAAAATTTAAATTTTGTTGGAAGTACTGTTAGGAATACTAG 

CAGAGGCCAGATGAATGTATGGTTAAGTTATAGCAAAGGAAAAGATTGTT 

AATGGTGAGGTTAGGAATGCAGGGTGACACCAACCTGTAATGTCAGCATT 

AGCGAGATAGAAGCAGGTGTTTAAGGCCATTCTCTGCTACTTAGCAAGTT 

GAGGCCMTCTGGACCACATGAGACCTTTTTTCAAAAATAAATCTCCTTA 

MCAAMGAGGCTGGGTTTTTTGATAGATTCTTCAAGATGTTAATGTAAA 

TAAATGGAAGACCAAGGATGGCATGCTAATATCCTCAGTGTCTGAAGAAG 

GACTATGTAGTGTTGGCTGCTGACTCTGAAGTAAGTGCTCATTACTGACA 

GATAGTGTATCTTAGAGCCTGGCAGATGGGATGGAAGTGAGGAAGCAAGT 

AGCACCTTTGTATATTATGTTCTAAGTAGCCAGAGATACTTGACACAAAA 

CAAAGTTGAGAAAATGTATCTTCTAGAAAATACAGAGATGGAAAGGTGTC 

CTTTCTATAAAAGAGGTATTAAACATTAACCTGAAAAAAAAGTTAGCAAA 

TTGGGCTTTGGCAMTGMtATAGTCMGTTTCATTTTTATTTTGTTTTT 

TGTATATGACTGTTTGGCTTGTTGTACCATGTGTGTTCCTGGTGCCTAGG 

AAGTCAGCTGGAGTTACAGATGGTGTGAGTTGCCATGTCGGTGCTGAGAG 

ATGAACCTAGGTCCTCTGGAAGAGCAGTTAGTGCTCTTAACCACTGAGCC 

ATCTCTCTAGTTCCTTCTGTAGMTTTTCATTMTTTACAAAGGAGAAAG 

TATAAATGATAAAACCATGAGAAGATAGACCGGCACTAGAATTAGTGGAG 

FIG.3C(1) 
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TCAAAATGTTAAT6ATATGTCAGATAGGCCTTATATGAG6AAGTTGCAAA 

ATTATGAAAATCCAGGCACTCCACTGAGTTAGAAATCTAGGCTCTGATGC 

ATACTGCTATGGTAAGGTAGCAAGTGGCCATTGAGTGCAGAAGTGAGTCT 

GGATGGGTCTTCTCGTGTTGTGGAGCACACAGACTGCTGTCTTCTGCATT 

GCAGTTTCACCTGTATTTCCTTCGMCTACTTAGCTTTt^MCTAGGGCT 

TAAAAAAMCTTTATATTTATGGTTTTMGTTATTO 

ATTTTATGAGACATAGTCTCACTCTCTAACCTAGGCTGGCCTGGAACTCC 

CTAGGTMCTTGAGCTGGTGATTCTCTTGCCATA^CCTTCTAAAATTTTA 

GATT<K)AGGCATMGCCAGACCACTCCT<3ACTTTTCTAGCCATTTTTCTG 

ACATGMGTGTMCTTTGCTTTCATMCTAAMTGATTTAGTTGTTTTGT 

TATTGTTTMTCCCTTTTGCTnGMTGTAT^^ 

ATATATAACCACAGACTTTTCCACAGGCATCCTACCCTAGGTCCAGAAAT 

GACTCTGA6ACGTCTTATATATGMTGAATGCCTAGGCCAATAGCTTTGG 

CTGATTTCCAC^GTOATAGCTCAGmTGCCATTTAAACTAGTCTAAG 

TCATGCCATGAGGCTACATACCCCTCCTTCAGmCAGGCGACTGTCTTC 

TCAGTTGTGTAATGT-CCTATCCTCTGTTGCT'GCTCCGCAACCCCCATCCT 

TGCGTCATAGTCCGTCTGTCCTCGTCTCCGCCCATTTACTTGCACAAGGG 

ACTCTACTCTAGMGTCCTCTCTCTGCTKBGAGCTTGCACCTCGGCTCTCC 

CCGTCTAAGCTMTAGGCMCAGCATTCTACA<^CA€GTGAT€CTTCCAT 

ACATCGCACAGGAGAmTCC€TACACAWACTOTOATCCA€C<3TGA 

ATCCMCCGTCCAGGtXafGTTCTrc^ 
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AGTCTGATGMTTTCTTTGTCTTTACAACCAAGAAAGATAATACTGTAAG 

AMTTTTGACTMCATT7TTTCTTTATTTAAATTACAGACTAACTGGCTC 

TTCTGGATTTGTAACAGATGGACCTGGGAATTATAAATATAAGACGAAGT 

GCACATGGCTCATTGMGGACAGTMGTTATAATGGCTGACTTTATTTTA 

ATTTATrATAAGAGCACAGTATAGCACAAAATACTTCCATGTGTGnATT 

GCTATTTCTTGAGACAGGACCTTTCTGACTGAGTAACTCAGGCTGACCTT 

(3MTTTTGCTATGCTACCTCTGCTTCCCAAGTGCTAGGGTGGTAGGTGTG 

GACCACCATGCCCTGCTGCTAAMTACCGTTCATTGATGCTTTTCATTTG 

GATAGTGtTCTTGCTTTTTAAMTTTACTTTTTGGGGGACCAGAGAGATG 

GCTCAGTGGGTAAAGTGCTTGCTGAACAAGTCTGGTTATGTGAGTTAATC 

CCTGGCTCCCACAGTGGAAGAGTGACTCCTGAAAGTTGTCTTCTGACTCC 

CACGC7TGTGCATGCACGCACACACACAAATAATAAAATAAAAAA7TAAA 

AGGAAATTTTC I I I 1 1 TGGGTGATAGGGATTGAACCTATGACTTCACTAA 

GCMGTGGTCTATTGTTAAATAATTCCTTTAATTTGTGG G 1 1 1 1 1 1 1 1 1 | 

TTAGGTTCCAAGTTGACTTMTGTTATAAATGAMGATACATACCAGAAA 

TTTGCATATTTCTMTAGTTTAAAAMCTTAGTTAAMTCTTTTTAATAG 

TTTGCTTAAATCTTTATATMTMTCCTATTATATCATTTTTCTAAATAT 

TGATTTTATTATCAGCAAAACAGTAAATGAGCCATCAGAATAACCACTGT 

AGCCTGTTTCCCTGGCCCTCTGTCCTTCCATCTGTCTATCTTCTCTTTTT 

mcCTmTTGTGCCTGTCAmAG{MC/^GCATmAGTCTCTGAAC 

AAAACTTTGAMTTTCCMGTMCTCTT<?TTTATTTCTTGTGTCT 
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tc aac c c aagaaatattatttac taac tc atttaaaagc aac aattataa 
cccactacatgttagca6aaamcctatttgtttttatt6a6aggg6atc 
acactagtaagcactacatggcatggggttcactgtgtagatcaggcagg 
ctggcttcgtgctcttgacagtcctcctgtgtttgtctctcacttctgag 
tgctggga™tagaca™ccmcacaccgattogggggttggggtac 
tgggatcagtccagagttgcatggatgctaggcaagcactccagcaactt 
agctatatccctggtcatamtgtcatmggaaaaamttcgtotattt 

AAAGAMTmMGMTTGCATT<3mAA^mCACAGATCTCm<3CT 
ATCTGGCAATC 1 1 I I I I GATATTTTCTTTTCTTTTTAAAAATAT'GTGGTA 

tgtaaacamcttaaatatgaatgggacagttccagatgagagtgaaaag 
ttaaatatttgggagaaaaattgataggtttatctattatggaaaatttc 
agagattttagtaaaatttgaaaatggagctgggaggtctgaggtagtca 

TCTAMGCTGCCAGTTGTAGAGCGTGTT^GTCTCGAGTCAGAGGGAGT 
TACTGATACACTTGTTGAMTT<XCCAGGCmAT€GGAAGT^TGA^ 
GCTGTOCTGTGACTCTGG^CA€GGCTTGTOGTTTCCT7TGGATTTAGT 

ctcagtcagagttgatacatagtttcctgaggac€t^ctttttggtaca 
gtgctgtgaamggcagagaagcaggtaaac™<w^tgt<3t«ttttt 
amgtgatgtg™tgamtc™cgtaagatgaataaagaaagaagt<bg 

GGACACTGAGGGCTCCTGTTTCTAAATCTTAAAAGCAA^GCTGGAAACAT 

TCTTTGAAGGCCCCTGMGTCAGAGCCC^^ 

CATTTTT<yVTATTCCCTTACACA^ 

FIG.3C(4) 
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MTGCAGGAMGCTGTTTATATTTATATATATTTATATTGTATATTTTTC 

TCCTTATAMTTCTTTAAAAGTCTGTTTTAGTAGTTAATGTTATGATTAT 

TATAMTTAC7TM7TATTTTTCTAGGCCAAATAGAATAATGAGACTTCG 

CTTCMCCATTTTGCTACAGMTGTAGCTGGGACCATTTATATGTTTATG 

ATGGGGACTCAATCTAGGCACCTCTGATTGCTGCCTTTAGGTAAGCCCTG 

CTGCATTTCATCTCAGGAAGTAAGTGTGTCTCCAGGAtGGAGTCCGTGCT 

GCATTTACTTTATTCTGCAGTCACACTCATCTCATGGAATTAGTTCTGTT 

CTGGTGAGCTACAGTTCACTTGGTTTTTATGTACTGGGTCGTTTTCCATG 

TATACTAGTATGTAGCCACGGTTAGTCTTGAACTTCTGGTTCTCCTGCCT 

CCACCTTCCAAGTGCTAGGAGTATAGGCTTGTGCCACTGTGCCTGACTCA 

TTTCACATTCTTGAACTGTGAAGTTTTGATAACACTATTAAATTTACCTG 

CTATTTGTGATTTTGTTAMG7TTGCATTAAAMGTTTTTGACTATATTG 

ATMTAT7TTTGTGACAMTTTAAATCAGAAACCATACCTTTCTTGTTCT 

TGTATGTATTTGATTCGATAGGCCCTTAGGMTAACTTTTTTCAATAGTA 

TATAGTTCTCTCAGTTTGTATATATGTATTATTAGGGATAGGAGGAGCTT 

TCTGGAAGACTATTTATAAATTGGACAATGGCTAGCTGTTGAGAGTGAGG 

MTTTGCTAGTTTTGTTTTGTAAATCCCTCCCCAATGCATCTGTATTAGT 

GATTTAATAAMTMTGCM7TTTGTCAGTTATATGGGTTGCACTGAATT 

TTTGCTATTTTATTTTMGAMGATTTTTGTGTGTCTACAGTGTATATG^ 

GTGTATGATATGTGTGCGTGTGCATGTGTGTGTGTACTTCTATGCAGGTA 

CTCACATCCTAT^TGTGCACGTAAGGTTGGAGTGCAGCCTCACATGTTG 
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ATCATTATATTCCACCTTGTTAGAGATAGGTTGTCTTTGTTGTTFGCT'GC 
GGCCTGGAGCTGGAGCT66AGCTAAC<aAGTCTCAGCCACCTGACATGGGT 
ACTGGGAACCAAGAGCAGCAAGACCTCTTCTT CTTCTTCTTC 1 1 \ I 1 1CA 
TmCGGTTmCMGACAGGGmCTCTGTATAGCCCT<ffiCTGTCCTGG 
AACTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGAAATCTGCCT'GCCT 
CTGCCTCCCAAATGCTGGGATTAAAGGTGTGTGCCACCACCACCCAGCCT 
AAMGATTTTCTTACTAAMTATATTTCTAAATTAATTAGTTGGAATCTG 
GTTCATACTTCTTTTTGAAACAAAACCAGCA 1 1 1 1 1 1 MCATTCTACATA 
CAGAGACATTGACACTAGACACTGGTTATGAGTAGTTACTATAAGAATGG 
GAMTTATTCCACCCTTGTAAAACTTAATACAACTCCTTATCAGGCTCTG 
MGACTTTTTAAMGCMGAATTGTATATMCACACAGAMT^TTTAGA 
CTATTTAGATCTTTATTGCATGGGATTTTAAMTTATTATfGTATTTGGT 
GGGCATGTTTTGTCTATGTAGCATATCTCCCTCTAGAGGCAAtCACCAAG 
TAGGTCCTGGGMTCAAACCTGGTACCCCTGCTCTTAGGTGTfCTTAACT 
GCTGAGCCATCTCTCCAGTCCTC 

FIG.3C(6) 
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AGGCAA6AAAGAGCCAGCGAGCCTCCAGACAGACCATTAGAAATTCCACA 

GTCAGCACAATAGGGAGAACAGTAAATCTTACATTAAAAGAAGGCCAGGG 

CCTGGTAGCAAAAGGTTTTAATTTAAGCACTTGAGAGGGAGAGGAGGCAA 

ATCTCTCTGATTGGGGGTTGGGGTTAATGGTGAATGCCATGACACCCTCC 

TCAGAGTTAGCCTTCTCCCCTAAAAAMTTTTAAATTCATTTTCAATGCT 

GACACAGTTAATCATAGACATTGTATCTCAGACACCTCAACATACTCCAG 

ACTGCAGCACCAGCCCACTGCTGAGGCTGTCGTTCAGTTGGTAGAAGGCA 

TGCTCAGCATTCGCGAAGCACCAGACTTCATCCTTAGCACTACATAAAAC 

TGGGTGTGGTCATGCACACTTATAACTTCAGCACCATGGAGGCAGAGGCA 

GGATGATGAGAACTTGAGGATCATTCTCAGTTACATAGGGAGTTTGAGGT 

TAAGCAGGGGTACAGGAGGCCTGTCTCAAACAAACAGACAAACAGACAAA 

CAAACAAACTTCAAAAAACTCTTGAAGTACTAGGCCTAGTACGTGCTGAG 

ATTGTAGGTATATGTCATCATGCCTGTTGTAGAATGAGTGAGAGCGGACT 

CCATAGGCTTATAGATTTGAATCTTGGTGTCTGTCTATGTCATGTCATCC 

CTGCACAAAAGCCCACACTAGGCCCACACATTCTCTCTGTCTGCTGCATG 

TGGGTTAGATGTGAGCTCTCAGCTGCTGCTCCAGTGCCATGCCTGCCTGC 

TGCCAGGCTCCAGCCATGACGGTCAGGGACTAACTCTCTGAAACTGTAAC 

CMGTTCCCMTGAAATGCTTTCT7TCATAAG7TGCCTTGGTCATGGTGT 

CTGCTTCACAGCAATAACACGGTGACTAAGATACCTGGCTCCTGCCCTCC 

CCACCCCACCATTATTTACCATAAAGTAAACAATACACAGTTGGATAACA 

TGATACTGMGTTATnTCCTGTTTCCTGATGTAACCCAATTTTGGACAA 

FIG.3D(1) 
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6ATTAAGCCTTAAATAGCAAGCTGTGA6GCAGGATAAAGAAAAA6CT0GC 
AGGCCMTGTCTGCTTTACCAMTTCTCTTCAGCAGTCTAAAGCTGCGGT 
CACCTCGACTCCTGTGATGGCATTTCCATCACTATCTTAGATATTCCCTG 
GGTCACMCCTTTTAGTACACAGATT<^MCTCTGATGGAAT^GCTGACT 
GCTTGGCTMTTAMGCMGCTAGAGTTTGTCTGGCTTCCTTGTCTGAAT 
GGGGAGGTGGTATTTACAAAATTTTGTAAATAAACTACTATATTTGCATG 
ATGTATATAMm^TGTCGCT^TTTTAAATCATTTAACGTAAACTGT 
CCCACAGMTCATCTGTTTGATTGGAAAGATTGTAGCTTCAAGAGAATTT 
CTGCTGAACCTGAAATGATrCATAATGATCTGTCTGAAGAATCTGTCCTA 
TCACCTAG^TTTTTCTmAGTOATAm^TACmMGAmCCm 
ATGTATGTGTGTGT€Tm<3TATTTATGTGAATGTATACCTCATCTATGTG 
GTCCTCMGGACACCTGAAGMGGGCTCTGtaAGCTGGAGTTACAGGGAGT 
TGT<^TGCTAGGAMGAAAGCTGGGTACACTGGGAMTCAAAAGGT'GCT 
TCTMCCACTGAGAMTCCTG<X;AGGGCCTTGGTHATTAAAAATAT€AA 
ACAAAACCAACACTAGTTACATAAGTATCTCTCTCTCTTTCTTTCTCTCT 
TTCTTTCTCTCTCTCTTTCTCTCTCTCTCTCTCTGTCACACACACACACA 
CACACACACACACACACACACACACAAAGGATCCATAATAGTTCTTCTGT 
ATC<IC^GTTAMTATMGTTCTTAGGGGCTAGAGAGATGGCTCAGCAGTT 
AAGAGTGCTTGTTGTTCTTCCAGAGGACCCAAGTTCAGATCCTAGTACAC 
ACATCAGGCAGCTCACAGCTACCCATATCTCCAGCTCCtAGGAAGAAGCAA 
TCMTGCCTATGtaCCCATGCAAGCACCAGCACACATATGCTCCACAAACA 
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TCCATATATATAGCTAAAAGTAATAAAAATAAATCTTCAAAAAATTAATT 

CTGGTTGMCTGAAAAAGATCACCTAACATTTAGAAAAAGCAGTTTACTA 

GTGAATAGGACATAAATCATGGTATCAAATATTCTGTTGTTAAAGGAAGC 

AACTAGAAAAAGCATGTGTTTGAAATAACCAATGGATACAAAACAAATGA 

GGCAACCCCAACATCTGTCAGTACCTTGCAAACCAACACAATAAATTTGA 

TTTTATTTAAATCGTAGTTAT7TTTCATGCTAGTAGTTTTGAAACACAAT 

AMTTTGATTTTATTTAMTCGTAGTTATTTTTCATGCTAGTAGTTTTGA 

MCCMGATCTAGATTTTGTATAGCCAeATAAATACACATTAGAATTGCA 

AACTGATACGAGCTTCATCTTCATCAGTCTCTCTTCATGAAAAGCAGTTA 

CAGGGACTGAGACATGACTCAGCAGTTAGGGCATGGGCTGTTCTTCCATA 

CGACATGGATTCAATTCTCAGTGCCCAAATGTTGGCTCACAACCATTTGT 

AACTCTGGTCCCAGGGGATCTGACACTCTTCTTGGCTTCTATGGCCACTG 

TATTCATACGGTACACAGACACATATGCAGGCAAAACTCAACAAAAAAAA 

TMGGTTTAAAAAAAAGAATTAGAACTTAAAGGCACTTCATTCCGTCAGC 

ACTAAATCAGCCTCTCTGGAGTCTTCCCACTTCATGAGAAAATCGTCAGC 

TCTCCACTGCTGTCTGTGGCTGAGGAGCAGGACCTGGACAACGTTCAGAG 

ATTGTCAGTGCATCTCTmCTTCTTTGGTTTGCTGTCATCAGGTTCACT 

GTCACATTCCCTTTGTACCATCCTTCCTTTMCAGCCTTTTGAAAATGCA 

GAAATGTTGGATGCT(3CCTTCAGTTCACACAGGCTGTCTT7TrAGCTCCT 

CATCTATCTATGCTTAATTTGTTAGTGGTGCTCACCCATGTATGTGTTTA 

TGTCATGAAGCCACAAGATGAGCCTTGATTGAGTCTTGCTGTCAGTGTGG 

FIG.3D(3) 

SUBSTITUTE SHEET (RULE 26) 



WO 00/05373 PCT/US99/J6484 

39/173 

atcacagamtgacaccctatcatctttccttcctgcttcttagaagtca 
ttgattctgcttatactcaaggcccacagtattatacttgggtgtgaacc 
ccaggaagcagggaggtggggggtgtcatggatactactcagatatctga 
ctgttgtgatatttcatcagttctcattggtcctatctttaaaatctgcc 
ctacatctagagctggctgtggtggtgtgtgtggtggcatcagtatcaga 
acttggattacagaggcaggmgattgtgatttttga€gccagaataggt 
gcatacaaagatcctgtctgcaaaagaaacaaatgtgcaaataattataa 
ctactttactaatagcctaactaataaccactgctagtgctgtgtccacg 
aaaaggtgaagtaaactgtgaaaatgacttccccttctgtgtgacacacg 
4ccgtcatgtgattttacttgtgtctcatcatt<aittitccttctgttt-gc 
atgtgtgaatgttcacatgtggaagccagaagtcagtgttgagtgtcttc 
atmttgatctctattctctttgttttgagacagggttttgagactaagc 
ccagtgctcagtgattcatccagtaaactgtagggagcttcctgtctctg 
cctccacagtgttgggattacaagcatgatccaaa™tgtgacaagg<3c 
tttactmcttagccatgtcctcagctccccactccccttttcttttctt 
cttc 1 1 1 1 1 1 1 1 a gacttactretttatttttatgaatgtcttgcctgca 
tgcatacacacacacacacacacacacacacacacacacaccccacatjqc 
mgcmttccagaagagggcatt<^tccctgaaactcgagttccagtta 
actgtgagcctgtcatgtgcgtactcggagctaaatccgggttctct^gga 
aggtcagcaa^tctocctcggagccgtctcmagctcatctcmct 
ctctt€aagcaagaaacctaggaatcatttoamcitcot 

FIG.3D(4) 
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TTATCATMCTTCACGTCMTTTTTACCTACTCTTTCAACAAATACATGT 

TATATTTACTTATTT7TATGTTTAGCCTGCTATTGGTTTCTACTTAGCCT 

CTTGCAGTAGAGTTCTGTCAGATTTATGTTTCTA7TGCTTTTMTTTATT 

TGTAMGGTGMTGGGAAMTATTTAAAAATTACAGATCCCATCATTTAC 

TATATTCTTAAAAGCCATGGCTAGCCAGGCTTGGTTGTGCATGCTTGTAC 

TCCCAGGACTCTGACAACTCAGTAAGGAGGAGAGTGAATCAGAAAATAGC 

GCCAGCCTGTGCTGCTTAGCAAGAAACAGAAACAAGTACAATCACACACA 

TAGAAAATCCCCCATTAATACCATCCCATTAGATATAATGGTCCTGTATG 

ACCATTCAACCACTGTTTGTCCTCTGTACTGCAGTAACAGTCTTCTGCCC 

TTGCCCGTGAAGCACGTGCGCACCCCGCCTCCAAGTGCTTTTGCACTGGT 

GTCCTCCGTCTAGATGTCCTGTTACTATATGTAAGGACTGGTTTCTCCTC 

CTCTTTACAGTTCAATCTAATTGTCTCATGAAAAGATCTTTCCTGACCAT 

CTGGTTCAGACAGGTTCTCCCTGTTGTTGTTTTGTTTTTTGTTTTATAGT 

TCTAMTTCCTTTCAGGMCTTTTGCTTATTTTAAATTCCCTGAGTGCAT 

ACGTGTGCTTGTTGTTGCTCATGCTCGTTGTTTGGGCTTACTTTACTATC 

AGCTCTGGATGTGGTTCACAGAAGGTGCTCAGGGGAGCACTCTCAGCCAC 

TCATCTCACACGGGTTATAGATATATGTATTGATGCTACGTTTGCTTGTG 

AGCCATGTTTTAMGATTAGAATATCTTTTCTATGTGTACTCTATCAAAA 

CACATGTTAGGGCTTTATCTATTTTATACAGATATtGGTGTTCTTGCTTT 

ACTMTTTTCATGGAATTTCGGTGMTATTAGTATTTTAGATAGGAAGAC 

TTGTCTCAAMTGTAGCTCAGCTGGTTGAGTGCCTGCCTGCATCTAGAAA 

FIG.3D(5) 
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6CCCTGTATTCACTCTCCAGCACCTCAGAAGTGGGCCATGGTGCATATGC 
TGTCATCTCAGCACTCCtBGAGGGAGAGAAMGAGAATCT-GGAGTTCAAGG 
TTATCCTTGGCTATATAACAAGTCCAAGATCAGCCTGGGCTACATGGCAT 
CCTGCCTCAAAATCAAACACCAAATCAAAAAGCTCACATCTT'GATCCAAA 
AGMGGTAGAGAGMTACACTCGGAMGTCmGAAACCTjCAAAGCTAAC 

tccaagtgacagtgacacctccttagcagggccataaattctaatccttc 

cccamgcccaccmctggagaccaagtattcaaagataagaatctatgc 

agtccattctccttcaaactaccacagtaggttttcttaaaaaaagaaaa 

mgmtattttmttgattgtgattattcagtattattcat-gaataatca 

tgaactacatggcaggactataaactatta 1 1 1 mil iaaagatttattt 

amatmatctatgtcagtacactgtagctgtcttcagacacaccaga 

agagagcatcaaatcccattacagatggttgtgagccaccaagt-ggttgc 

tgggmttgmctcaggacctctggaagaacagtcagttctcttaaccac 

tmgccatctctccagcccctataaactatta™tatttataaaatata 

aatccgtgagtctgtccacccctgtgtccacatcgatcggacatcttraa 

actggatotatcatacttagmgmtacaagatactctgtttr3tcatt 

tgggtgaamtatggtctcmatttocaggtatgacct^cm 

gaatg^cttccactaaaccattctgt<^cagtgt<3gtt^tmgata , fgg 

tcattctttg^cattacataaggtaaactatctcaactcttcaccaagca 

agaagttcmctcttcctgtt&tttat^^ 

t<^ttttagtt<sgtataaigctttgttttgat^tc 
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CACCAAGTTGTCACCAAGTTGTAATTGGAAATTGAAGTTAGAACGA7TTT 

AATCCATGGTGTCTTGCATTTGGATACTCTGATCACAGTTAACAATGAAG 

ATTAAATAGTGTCAGCAAGCCTATGCCCATTATCAAGTCTAGCATACTGC 

ATGCGTGTGACTGAGTAGCCATTGTTATCTCCTTGTTTTGAGCGTATATT 

GTAGMTGAGGCMCTGTATTTTCCACACCATTTTCGTTCTGTAACACGT 

TTCATGTAGAGMGGTGATTTAGAGAGGGGAAGAATGTGATTGTATTGGT 

TGGTTCTTTCTCTATGCTATTCCTAGCAAGTCACCGAAGAGCTCATGTTA 

CTCACACTTCTTAAGCTGGGATCACAATGAGATTGTGAACCACTCATTGT 

TGTTTTCCMTATMTTTTTAAAMGATGTATTTATTTTTATT^ 

TGTGGGTGTT7TGCCTGCATGTATGCCTGTGTATACTGTTCCTCCAGAGG 

TCAGAAGAGGATGGCATCAGAACTGGTGGCTGTTAGCTGCCATGTGGGTA 

CTAGGAACTAAACCCGGGTCCTCTGCAAGAGCAGCAAGTGTTCATAAACT 

CTCTCTCCAGCCCTAGAGTTGATTTGTTMTGGTTTTAAAMTCCTGTTT 

ACATCTTTCTTATAGGATAAAATCTACATGTATGGAGGAAAAATTGATTC 

AACAGGGAACGTGACCAATGAGCTGAGAGTATTTCATATTCATAATGAAT 

CATGGGTATTGTTAACTCCGAAAGCTAAGGATCAGTATGCAGTGGTTGGA 

CACTCAGCACACATTGTTACACTGGCATCTGGCCGTGTGGTCATGTTGGT 

CATCTrCGGTCATTGCCCACTCTATGGATATATAAGCGTTGTGCAGGAAT 

ATGACTTGGGTATGTATTTTTTCCAGTGGAGGCATCTTGAATATCATACT 

GAGAACCCCTGCCCTTATTATTAGGACACCGTAACAAAATTCAGCATGAT 

CTTGATCCAGTACCTTGTC7TGAAATAGTATCAGTAGATAACTCGTGAGA 

FIG.3D(7) 
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TTGAGGTTGTT6AAGTCCCT6TGCAACAGCTGTTTCTTACTTGTCAAGGT 

CTAGTCTTGGCTTGGGAGGGGTTCTGAGGAAAGGGGTGTCAAAAAACCCA 

AAMGTCGMTTGTAGGTCCAAGCTGGCAGCTGTATATT-GCATTAAGGAA 

AGCTGAGGGAMTTTGGGATATTTATTTCATCTATTAGTCTACATCAAGC 

AAGTCAAGCGCTCACAGTCAACGTTTGCACCCTCAAATTAGTAACAAAAG 

AGGGGGAACTGAGGAGTCCAGCATCGTCCTGGTTGGGACAGAATGACATG 

GTTCCAGCCCTGAGACAGGGGCAGCAGGT-CGGGGCCTCCATGGATGTCAC 

ACTATGGACATAAACCTGTTTGTATAATAATGTACATATTTCATGCTCCT 

CTTCTGAGTMTGTCCTTCTGTTAATGTGAATGACTTCATGATAATCAGA 

GCCAGTGTGAGTCTGGGAAGTAAATGGTGGGACCTTCAGGACAGCTCTTA 

AGGCTGTGGAAAAGAACATGAGTTCAAAACCATATACTTCCTCAACTATA 

CAAAAATAGAAGGATGCAATATGAATTGTATGAGGGGCTTCACAGATCTA 

MGGMCAAAAGCAGCTTCGCTGTGAGCCAACTTGTCAGAAAGATATTGA 

GTMGCAGTTAMGAGATTTAGGGAGTGCTGATTGCTAGAGGAGGCCACC 

CAGCTMGTTTGTGCTTACAAAGGCAGACAAAGT(XTGAGTTCAGGGT*GG 

GCCTGGMCAGAGCMGGTTAGTTAGACCTT€GTCTG<3TAGAAATGGTAA 

TTTCCAGACAGGATACCCMCTAGTTTTTGTGCTTAACAGAGGCAGGTAG 

ATCTCTGAATTCTTTTGTMTGTTAAAAGGAMTGTGTGCiTKaTTGTCTC 

CCMGGGGCCTGAGTCCCAGGATGCTGATTTATAGGAAACCTGGAGTAAC 

TGGGTTTATGACCTGCAGGAGAC<aAGCTA^^ 

GCMGAGAGMCT<aCCTGGAGAACT<src 

FIG.3D(8) 
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AAGCTGTCTAGA6A6AGAGCTGTCTGTA6AGAAAGCCGGTCAGA6A6AAA 

GTAGACTGGAAMCTGTCTCCAGCTTGGACCCACMTTTGACTTTTTGTT 

TTTGTTGACAAGTTGCCCTCCCCCAGAAACACCTTCCTCAGGACCCCTCC 

CAAGCCAAGGCAGGGCCTTGGCCCTTCTTGTCAGCTTGCAAGGAGCCAAA 

GATAGCATTAAATGCTTTGGATATCAAAATAAGCAAAATGCAAAACAGTA 

AACACTCTAAMTMTTCTGGCTAGTCCTTTAAATATTAGGCCAGT-GCAC 

TGTTATTTTACCTTMTGTATMTCTTGTGTTACATTTTATTGTTnTAT 

TGTATMTAGGMTGTCAGMTTATMTTTTGTMCATTTGTTTGACATT 

CCTGTGAAAATGCATCTAAAGATCATTAAAGTGCATCTGAAGATCATAAG 

GACTCACTGAGGAGCACAGGGAATTAAGTGTCTGCTTAAGAGAACTTTGA 

ATCTTTMTCTTTAGMTTTGTTTTAAAMTTTGAATCTTGCCAGTGTGG 

TGGCGCATCCCTTTGGTCCCAGCACTCAAGGGGCAGAGGCAGGTGTATCT 

CCATTAGTGTGAGGCCAGCCTGGTCTACAGAGCAAGTTCCAGGCCAGGCA 

GGGTACACAGAGAAACCCTAGCTTAACAAAACAAAACAAAATATGAATCT 

TTAAAMCTTGTTCTGTGAAAATTTCATACATGTATACAATATAGCTTGT 

TCATATCCACCGCCATTCCTTCCAGCTCCTCTAGGCTTTCCCAGTGCATC 

TCCTTCCTAGCCTTATGGCCTCCCTTTCAGGGTGAAGGTTAGCACACTGA 

GTCCAGTTAGTGCTGATCCGATGCAGTCTTGTCTAGATGGTCTTCTTTAT 

MTMGGTGAAAGTATATCCTAAACTTCCGTCTTTTGCTCTAAGGTG7TT 

AGACTTTAMCTMTGTTTAAATCGTTTAAATMTTTATTATTTCATAAG 

AAGAGGAGCCTGCAACAITGACrrTAACTATTGTCTCTTATCCAGAAAAG 

FIG.3DO) 
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MCACATGGAGTATATOCATACTCAGGGT^TCTTGTCCAAGGGGGTTA 

TGGCCACAGTAGTGTTTATGATGACAGGACCAAGGCTCTGTACGTTCATG 

GTGGCTACMGGCTTTCAGCGCCMCAAATACCGGCTTGCAGATGACCTC 

TACAGATACGATGTG<^TACTCAGATCTGGTGGGT<37TTTCCTAGAGCTT 

TCCCTTGGTAGTCTAGMTCTGCAGAG<3CMTTGATTAAAAATACTGTGC 

TATGGTTTGACTTTTCTTCAGCATTGTATGTMCAMGTTAGGAGATCAA 

TACAGTAATAGAGTTAAGGTACTAATCGTGCTGTTGCTGTCTG1TAGTGC 

TTAGTGCTTTAGACCTGATTCACTGAACTCTAGCAAGGTTTCCTCTCTTC 

AGMTTCTCAGCMTAAMGCTGTGCTGATTTTATCCATACTTAAAAAGC 

ATATCCTTCCTTTTCTCTTTTTGGTGTT<^GATCAAAGCTTGTACATGA 

ATAGGCTATACCATCTTTATCCATTTACATCACCAAACAGGATCCTCTGG 

TGCCTATTTGATAGGGTTTTCACTCACTTC^^CTGAMCTTGGGTTCTA 

AGAGTATGGTACTTTTAGCAMTGGAMTAAATTTCAGTTATGATGCAAT 

TATAAAGCACTGGTCTCTCTGTATTTCCCTCCTCCTTCTACTCCCTGCCT 

CTTCCTTTCTGACCCCCTCTCTCMCATACATTAGAGACCATGCTTTGAC 

TGTCMTTTATGCTGTCCTGMGATCAGGTCTnAGTGGCTGTGAACCAC 

GGAGCCTATGCAGTCGMGTTCTG<3TCTCTOGCTTTT^CTTACTAATAA 

MCACTGAGCATAMTm^mGTAmCACMTTCTOCCTCGAATT 

CTTMGTGGMTTATGGAGGCATACAGAATGMCATTTTAGG^TTTTAA 

TATAGTTTCCCGAMTTTTAACAGATTTTCAT^aATTCTTA 

CTTACGTATA^G^aAMTCAAGTATTGCACATTT^AATCTAMGTTATA 

FIG.3D(10) 
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MGTMTTACATTTAAATTGGCAAATAAGTATTCTTTTAAAACTAACCTT 

ATATTTATTATTTCTAAATAAACTCAAAAGGACCATTCTTAAGGACAGCC 

GATriTTCCGTTACTTGCATACAGCTGTGATAGTGAGTGGAACCATGCTG 

GTGTTTGGAGGGAACACACACAATGACACTTCCATGAGCCACGGTGCCAA 

ATGCTTCTCCTCAGACTTGATGGCTTATGACATTGGTAAGCTTTCCAAAG 

ATGTTTTAGCTTCAGGAATATTTTCTTTGCTGATGGAAAGATCACTATGT 

TAAAATMTTGCACCATTTAAMGAAGTCCAGGTGGTAGAATTTGCATTT 

MTTTGAGTAGGGTTACACATCTATTGAAMGCATTATTTTGGATTAAAC 

TACATTAMTTCTTTGTGAAATCACTCTTCTTMTTGCT7TMTTCTTTT 

TTTAGGTTGAGTTAATTGGTATCTTCTTTCTTATAAGTGCCTTACATAGT 

AGTGGTGGTAGTTGTMCCACCAGTGTTATGTTAAGTTTGATGGGATATG 

CTGTTTCCTAGAAACCTGGTTTTACACATGCTGTTGATGTCAATATACAT 

GTGGCCAGAAGAGGGCAGTGTCTGTTTATTCCTGGAAAATAAACATCAGC 

TGCTCTGTTGTGTAMTATCACCCATGTGATGTTCTTTCTGTTTATTTGT 

CmGCATnTGAGACAGCCTCACTATGTAGTCTAATTGGCTGAAGCTCA 

GTATATAGATCAAGGTGACCTTGAACTTAGAGAAATCCTCCTGCCTCTTC 

TGAGTGCTAAGATTAAAGATGTGTACTACGAATGAAAAAAAAAAATGTGT 

ACTACCACACCTGACTAGAGATTCATTTTAAAAATTATTCTTATTGTGAT 

AAMTGCTCAGAATMCACTCACCATCTTMTGTTTTMGTAGTTTAGAT 

TTAAATATATTCCTAGTGTTATTCATGTTATAATACCATCTGCTTGCCGA 

CTTCTTGTAAAACTGAAACTCTGCCCTTAAACAATAGTTCCTCTCTTCAT 

FIG.3D(11) 
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ccctcactccagcctcttgaaatcattttctatatctctat^ttttgac 

tagtctaaattaggca i i 1 1 1 i aaaaaamtattttgtttacttgtatgt 

gtatgagtgttttgcatgcatgtatgttaagcacaccatgtatattcagt 

gcccatagaagccaaaagtaggcatagattccccagagctggaa7tacag 

acttttgtgagccaccatgtgg^t^tcgatactctccgcamtcctttg 

gaggaatagtgagtcttcttagctgttgagccatcttgtcagccctagat 

gtttgtttttmcamcgtgtttttgccagcca 

gaatggggggtacactatagttagtccttagcttcaagcttgtggaagca 

gaaatgagaagacaatataatc7taactcaggaggattcttgctggctga 

aacamgatgtgamttacctcg(^gcactcctmgc£actgg<3gt6agc 

agggtggtctggagaggccttgaagagaagctgtctgagcttgttcctgg 

ggacactgggagtcamta^cctcctggtacagggggatttagtgcagac 

mgaggcaggamgtacatgtcamtatttaggacttttgaaccgctacc 

tttcttttgtcatggtmcacagaaggtagcaggtgactgttagactaga 

atgttcagatctgattcagagtgccagggatcgtr^ttggtcttgtgta 

mgtctcacmgtgatagmtcatatgtctgtcttagactttttttg7tg 

taggtatittagatttttcttgtttttcctttttgtaa^ 

cactatggtccaggcaggcttaagac™tggtmccatcctactct-gcc 

twatg^gccaccatgaccmtttmgaagctctcttgggtggcattgtg 

ataagtgatctggaag^^atatt<^cagttagcaggct^tac1=gcag 

AAGTCCTMTTAGGmCTATCAAGGtt^^ 

FIG.3D(12) 

SUBSTITUTE SHEET^RULE 26) 

BNSDOCIO: <WO O0OS373A3 IA> 



WO 00/05373 



PCT/US99/16484 



48/173 

GTACCTGGCTGTTGTGTGTCTT6ACAAAAATATAACTGCCCTTTCTTCCC 

AAGTGTCTACTATGGACCACCTTTGCCAAAACTAAAAGCAGATTCAGAGA 

AAAACATATCATGATTGCACATGGCTATAATCCCTGAACTTAGGAGGATG 

AGAAATATGGCAAGATTGAGACCAGTCTGAACTATCTAGTAAGACCGTGT 

CTTTAATAAAAATAGTAAAAATTATAAAATCAGGGAGTAGGATCTGGGAA 

GAAGAGAATGAAGTAAGTGTGGGGCATATCCAATTGGAGATGTCTTTAGG 

ACAGAGCTGATTGCTGAGAGGTGGTTGTAGGAGAGGTGAGTTATTGTGGG 

GCATAAAAGATGAGCAAGAGTCAGAGACAGTTGGAGAACAGAGTCTGAAC 

AAGAGTAGAGACTAAAGAGAGTGTCAGAGAAGCAGGGAGAAAATAGGTGA 

GATTGATGACCTGTGAGATATGTTAATGGCCAGAAGAGTGGCTAAAAATG 

ACTGGAGAATCCTTCAGACTTGTCAACAAAGAAATCCT7TAGCCTAATTT 

AGGGTGCAGGCGGCTGAGGAAGGACATAGGTGAAATATGTGCTCTGTGTG 

TTCATTTTTATTAAAGCTTATCTGCAAAGGCCTCAGATTTGCTGTGTACT 

TGTAGCTGAGGCTCTTTTGAACTCCTGGTTCTCCTGGCTCCACCTTCGCA 

AGTGCTAGGATTACAGATGTGTGCCCTAGTTAAAATAGCTGTATACCTAG 

CATTAAAMTTTTAAGTTAGAAAATACTGTGGTGCTCCGGGGATGCATCT 

CAGCAGTAGAGTGCTTGCCTGCTATACACAAGGCCCTGGGACTGATCCCT 

AGCACCACAAATACTAAAGCAGACATTCTGGTAGGGAAAACTGGTAGACA 

GCAGAGTGGTGACCATCAGGAGGGGGGTTGTGGGTGATGAATCACTACAT 

TMTTAGAAGTTCTGTGCAGTATATTTATTTCATGCCCTGAAACATreCT 

GCTGCTGTTGCTGCTTTCCTTTACACATAATAACATAACTAAAAGACAGA 

FIG.3D(13) 
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CMGCAT6TGGTATGAGGCTGTGGAT6AGGCATTCTTTGTTTTCCTTTTT 
I I 1 1 1 I I 1 1 I I GAGACAGGGTTTCTCTGACCTGGCTGTCTGAAACTCAGT 
AGGTAGAACAGGCTGGCCTTGAATGCACAGAGACCCTCCTGCTTCTGCCT 
TCTGAGTGCTGGGTTCAAAATTTATG 1 1 I t I I I CTATAAAGACTGAGAGT 
TCACATGGACTATATATGACMCCTACTCTGAAATGTGTTTTTCTCGCCC 
TTAGCTTGTGACCGATGGTCAGTGCTTCCCAGACCTGAGCTCCATCATGA 
TGTCMCAGATTTGGCCATTCAGCAGTCTTGTACAACAGGTAATTGGAAA 
GCAMGGCTCTATTACTGTCTTACATCTTATAiTCATTTTTAATATCAAC 
TTCCTMCAGTTGTATCTGMTCGTMGAGGTTTGGGGAGAAAAAAGGAG 
AGMGGCAGTTCTAAGTGCACGATAAGGTAAGGGGAATAGGACTGGGAGG 
TTATGGGGTCAAAGAGCAAGTCTGAAGTCTGCACTATATCCAGGTGTGTG 
CTCAGGMTACTTTTCTGACCAGCAGAGCTCTTTTTCCATTTGCTCCAGG 
MCCTTAGTCCTGTAMGGACATGCAAAGGACTAGGGTTGTGGGCCAGCA 
ATAGAGTGTTTATCTAGCTT^ACAAGATCCTGAGTTCTGACCTCAGCAT 
TTTGCCTTCTGCAMCACAGCATTTGGCATAAGGGACATGCAGAATGGCC 
ATTTTACCTAGTCACTTGAMGTGTGCTTTMGATTGAGAAACTTAACAG 
CCTGCTGATGCTGACTmc™Tm^TTCTGTOCT^mCT<3CTT 
CT7TCTTTMTACTCTMTCCTTACATTATATAGTCCTACAGGTATTCAA 
ATTTTCTGTTG^GTTTCCTMTACAAGT^ 
AG6ATAAAAGTG€CATTCTG^^TreTGAAGAATGACCGTTTAGAAGCTA 
WAGTCaGGGAAAGATWATCmAATCAT^^^ 
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CAAGTATATAGGGGATTGTGGCAAGACCATTGTATGATTAGA6ACTAAAG 

TGGAMGATTTTTTAMTATCTTGTTMCTTGAGTGTTATCTTAAATTAC 

AATCTGATGCTTTCCTTCAGAAAAAGCCCTAAATGCCTCTTGAGGTTTTC 

ATCTGGCAAGTATCATGTCACCTGGCCTTGCTGGTGGAATCTGCCCCAGC 

TCATGTGTGTTCTTAGTGTTCTCCTAGCACAGAGTTAGGCAC^TGTGGGC 

ATTTGCATACTAATGTATAGTAATAGTAACAATTGAATGAATTGTCTATT 

AAMC ATTCTTMGTTTTAC C C AAAC ACAGAGAGGTCGAC AATTTGTC AT 

AAAATGTAGTTTATCCATGAATCAAAATCAGGAATGACTGTCTGAACAGT 

Gl I I I IAI 1 1 i 1 1 ATTTTATTTTATTTTGTGTAATTTCTGTGATGTGTTT 

GAATATCTCAGTTTTAGGCAGGATTGGAAATGTTAGAGGTTGGTAAGAGG 

TCATGGTTGCAGTTTGATCATGAGAGAMTCGATGGCTCTCCCTTCATTG 

CAGTGTTGTCAGTCAGCAGTGTGGGATCACCTATGTCTAACAGTTGTTCT 

AATTGAGAGAGGATTACAGGAGGGAAAGCAGTGAGATTGTGAGGTGCTAG 

ATGAGGAGATGGCATTTACCTAGCAGCCTTCTCTCCCGCCCTCCCATCAT 

GTGACCTGAGAGATTCACMTTTCTGAAGATATCAGCTGTGCTTAGTTTA 

AGCMTAGTTTTATTAACTAAATCCAACTTGATTCATGTTATTCCCAGGG 

mccagtggtaggattaaaaatgmtcctagtgttctttttggttattgg 
mtgtcaagttttcagacactgtaacgaatacagagccatacaatcacta 
tatttattt<^tcctttgttgacttagaaamtTgaagcccagtttaggt 
gagctaccamtttctcattgtggattagtattaaacttgcgtggagttg 

T-GGGATCTTGGAAGTGGGGGCTAAGCATCCGTGTTTGTCACAGCCCAGAA 
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GGAACAGATGAGGTOCTmGAGGAGTCTOTGTCTTTATGMCTFGGA 

CTTAGAAATATTTGATGTGTTTAATTCTGCTGTAG I 11 I 1 1 AAACTCTAG 

CTAGTGAGCATCTTTTCACAGGAGCGCTTGAGTCTGACCTACA€CCATTG 

TCTGTCTCTGGTGTGCATATTACAMTGCACTGG^AGCtaTTTCTTGACGC 

AMCATATMTTAGATTTTTCTTCTAAAAAGGTCTAGTTTGGGAAGGAAT 

GAMGGGATTAGAGAMTGTTGTGGGTTTCGTATTTATTTATTTATTTAT 

TTATTTATTTAATGTATATCAATGATCTATCTTCATGTATACCTGCATCC 

CAAAAGAGGACATCAGACTCATGATGGTGATGAACCATCATGTGGTTGCT 

GGGAATTGAACTCAAGACCTCTGGAAAAACAGCTGGTGATCTTAACTGCT 

GIAGGCATCTCTCCAGCCCMTTGTTCTCTT^ 

TMTTTAGAGATGCCCTGCTTTTCCAAMGTGAGTTTTAAACACTAATTT 

CCATTGTCAGTGGATrGGTCTTTTAAGAATATAGGTAGTGGTGGCACAGG 

CCTTTMT<X)CAGCACTTGGGAGGCAGAGGCA€GTGGATTTCTGAGTTCG 

AGACCAGCCTGGmACAGAGTGAGTTCCAGGACAGCCAGGGATACACAG 

AGAAACCCTGTCTCGAAAAGCAAACAAACAAAAACAAAACAAACAAACAA 

AAACAMCAAAMGMTATA<3GTTGGMTAGGTTCGAA<3CAGCCAATGAT 

AGTGCATACCTTTMTCCCAGCACTTGAGAAGCAGAG^AGGTGGAACTC 

TGAGTTTGAGGCCAGCCTAGTTAGTCTACAGAGTATTTfCCTCGAGAGCC 

MGGCTATATATAGAAACXCTATCTTGAAAGGCCAAAAAAGGAGGAAAAA 

AAAAAAAMGAAAAGAAAAAAGAAAAAAAGMTGCAGGTTGGGCAGTCAG 

GGTMGTGTCTAAGGTMGAGGMmTW 

FIG.3D(16) 
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CTGCGCCAGCCTAGGCTACAGAGTACTGAAAGGG6AAGAGACTGTCCATG 

TGTCAGACCCTCATTTCTCCAAMGTCACATGACTATATTTTTTCTGTAT 

TGCCCACTCTTCCATACATGCACCTAACAATAAATATTGAAGTTCACTCT 

GTGGCACTATATCTATGTGATAGACTTCTAGAAAAGTGATTTAAAGTTCA 

AAAGGTAMTACGTAGTTTTGTTTCMGTTGCCAAAATCCCTTTAGTAGA 

CTCCTACAATCTTACATGCCCAGTAGCAGTATAGAAGCTTGCTTGTTGCC 

TTGMGCCTCACCMTTCAAATATTAGGTMCATTTG7TACATTTTTCTT 

TGTCAGCTGGATAGGTAATGAATGACACAACAATGTGTTCCCATTTTCTC 

TGCATTACTAATTGAAGTCCTATCACCCACAGCAGACTGAAGAGTTCCTT 

TMTATTTTATGGACTTTGACAAACCTAGGATTCATAGCTTCCATACAGA 

GAGGAATTTCACAAATAGCAAAGTTGGGCTGTTAGAAGAATAAAAAGAGA 

A7TCTGAGTACAGCTTCTCAAAGAAGAGTCCCACGTAGGTGTCCTCTGGG 

ATGTGCCTAGATGCAGGGTTATTGTACAGGAGCTCTTCTGTCTGCTCTCT 

GATACTTGAGATTATAGGGTTGCAGGGAAATGCATTAGATGGCATTACAA 

ACTGATMGATAMGTTAGGAGCTATCAGAGATTTAGGACATGGTTTTTC 

TCTGTAAATGGGGCTTCTGGTGAGATTCCTAGAAAATGCTGTTTATAGCT 

AGGAATGGGGTTATAGCTAGGAATGGGGAAAGACCTTAAGCAGTTGTGAG 

CTGTGGTGGMTGCATGTGTTTTCAGTTTGCTAAGGCTTCCGGGAATACT 

TTTCCTGTCGATMTTTTCTTTCACTCTCTTTGTAGCCTTCTTTGTATTA 

AMTCCTCTCTGCTTGC7TTTGTGTGTGAATGTGTGTATGTCTGTGTTTG 

TGTATGTGTGTGTATGCATGTGCATGTAGGTCCCTACATAGGACAGAACA 

FIG.3D(17) 
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TAmCCTCGAGmTAGGTCCTTCT^GCAGCCTTTTAGGGAACCAAAC 

TCTGTCCTCTGGAAGAGTAGCCCCTTTAACT6CTGAGTCATTTCAGCCTC 

MGMTCTTCTCTTTTCCCTATTAGTA6AAGATGTCATCTTAGCTCTAGG 

AACTACACCACCTCTGGCCTCAGTGGACACCCATTTACATATGGACATAC 

AGCAGACAGACATATMCTAMGATAAAATAMTCTTTTTAAAATGTCAT 

TTCCCTGTGTACTMTTTTCCATGTACACACTCACAGGTAGATTTTTAAA 

CTATTCTGAGTGATCACAAAGCAGAGCAGAAGGTGAAATTTGAGAGAATA 

GATGATATTAGTGGATTTTGAGACCTTGAAAATAATGTCTCAGAGCATTA 

AATTAATCACTCATGTATGTATGTATGTATATAAGTATGTATCCATGTAT 

TATGTGGATGGGGGTGCTGTAGCACATGTCTGGAAGTCAGAGGACAACTT 

TGTGAAGTCATGTTTCTCCTTCCATCTTTATATGGTTGCAGTGATT<aAGC 

TCAGATTGTCTACCTGTGTAGCAAGTGCCTTACCTGCTGACCTGTeGCAC 

TAGCCCTCTCAGAGGACTTTTMTATTT^^ 

GTCAAMGTTTATTGTGAGCCAGGCACTTAAAATCCTAGCACTTGTGAGA 

CACMGATGGAGGTCAGTCCAGTCTACTGAGTTCTAGAGCAGCAAGGGCT 

ACACAGTGAMCCTGTCTCAAAMTTTCAAAAGGGGAGCTAGAGAAATTA 

CCCMGGIAGCTAAAGGGAACTGCAACCCTATAGGTGGAACMCAATATGA 

ACTMCCAGTACCTGGGAGCTCTOTCTTTAGCTGCATATOTATCAAAAG 

ATGGCCTAGTGGGCCATCACT€CAMGAGAGGCGCATf<^CTT^AAAC 

mATAT<£CCCAGTACAGGGGAA€<£H: 

T^TAG6G^TT<3GGGG<&T^^ 
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AAAATGTAAACGAG6AAAATACCTAATAATAAAAAAAAGAAATGATATCA 

GAAAAAAATAAAAAAATAAAAAATAAAATAAAATAAAATTTCAAAAGCAA 

CAACTGAAACCAGCCCTACGTCGTGCCTCTGAGTTCTCAGTAAATTCCTT 

CTCTCTCTCCTCTCAGCACCATGTATGTGTTCGGCGGCTTCAACAGCCTC 

CTCCTCAGTGACGTCTTGGTCTTTACCTCGGAGCAGTGCGATGCACACCG 

CAGTGAAGCTGCTTGTGTGGCAGCAGGACCTGGTATCCGGTGTCTGTGGG 

ACACACAGTCGTCTCGATGTACCTCCTGGGAGTTGGCAACTGAAGAACAA 

GCAGAAAAGTTAAMTCAGAGTGTTTTTCTAAMGMGTATGTTTTTTCT 

CTACTTAGMTTTAAAAATCTAATTTTATCTGAATTGTGAAGGAACCTAG 

TCTCTGTACTTTCCTGTTCACCTTACTCTCTAGTTATTTCTTAATAAAAA 

AATACACAAGATCTTTGGATGGGAGGAAGCATGTGGCTCCTGGAAGCTGT 

TAGCAGGTAATAAGTTGTCTTTGAATTACACAGGCTTTGTGTACCAACTC 

CTGGTCTGGCTGCAGGTGATCTGMGCCAtAGCACAATGAAATTTGTTTT 

CATTTTGGTTTTATGAGACAGGGTCTTGCTCTATAGCTCATACTGGTCAA 

GCTCC7TGTCAGCCTCCTCCTTCAGCCTCTTGAATGCTGGGGTTATAGGC 

ATGCATCACTGGCCCTACTTGGGAAATATTTTGATGACAGACATGCTATA 

TAmcmGmAGTTTAGTAGCCACTAGCAATCTGTTATTATTAGATA 

TTTGAAATGTG^TATGTMCTMGGGGCTMCTGTTTTCTTTTCTTTAG 

TGTATGTAGTGAGGCAGATGTAGTAGCACACGCCTCCAATCCAGACACTC 

ACGAGGCTGAGGCAAGAGGCAGTTCTAGGCCAGCCTGGGCTGTGTAATGA 

GACCTTGTCTCAAGAGCCAAAACATCAACAATAAAAGAACAGTATGTCGC 

FIG.3D(19) 

SUBSTITUTE SHEET (RULE 26) 
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TATTGGCTGTTATGTTGATGATGAAGGTCTAGTGTTAAGGATAAGAGCCT 

CTAATGGTATGATCACATATAGCAAATTGCTCTGGTA6ACAGCAGAGAGC 

TGCTGTTCTTGAAMGTATTTCCAGCCCCCTTTAGCTGTATATAGCAAGC 

AGTACAGCATAACAGACAAACTAT<GGTCCCTTCTTCTAGAGCCCCTGGCG 

TGCTCnGTOTTmCTCTCCTTOCTACTT<3CTTAGTGGTT<^ 

GCACCACTTCACCMCTCAGCGMGTAACGTGCAAAAATGTTTGGAAAAT 

MGMTGCCTCCMGATATTTGTCCATATCMTCTTTAAAGTATGAAACT 

ACTTCCTTATCTAGTTGTTGCAGTTACATGAGAGTTATATTAGGCAGAGA 

CTACTTCTGTTTTTCTGGTATGTGTTAAATAAAGTTGTCCAGGGACATAA 

AGCTCCTGAGGCTCTGCTGTTGATTAGMTTTT^TTCATTTATCGAAAA 

CAGCTTACCAGMCCTGGTAGGATTCATMTTCTCCCGAAACAGTTAGAA 

TTGGTAGMTMCCAAMTTTAAAGTTAAGCTTAAATATACAGTGCATTG 

GAAATAATATTATCTTCTGAGGTTCAGTATGAGCCCATTAGTTTACCTCA 

CTTTCTGGGTAGACCTMTCCTGTCAGAGTAMCTT'GGCAAGAAAAGCAG 

CCTACATGAAMCTGATCAGGCAG©aAA€TTTCTCT<BGCCTCTCTTCCTG 

CTTGTGTATGTCATATTCATGAMTGATTTATAGATGGCAACATCGCTTT 

TAGCTTCTTGTTTGGGGATTTAATGAGAATTATGTTAGGTCTACAAAGAG 

TGGMGTTGTCAMTCCACAGGTTTCGA<3TCACATGAGTATATA<3AGTTC 

GAGTOGCMGTCCCTCCTGT^GTT15T^T<:ACTCGGTATACCT<3CA 

CCCA<^TAGG<XTT«CATTOTAACAAGGACAMTCTATTGGT€^ 

ATT<aCTTTCTTA<3GCTTCT<3CA 

FIG.3D(20) 

SUBSTITUTE SHEET (RULE -26) 

BNSDOCHD: <WO 000S373A3 . 1 A> 
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TT6ATGTTTGTCGTGGGAAGAAAAGCATCCATTACTTCTTAGAAGCTATA 

AMTTMCAGACCT7TGCTTTTCAC"rTTCTGGACACTATGGGAGGACAGT 

TATAAMCAGTGTTTCTCGGATTGTCTGCTTATATCTGTTTTATTTTAAC 

CTAMCATGGCACTGCTTTTTTCCTTTCAGTTrGACTATACACTTTGCTT 

CCTGACTATTGTTAGGAGCTTTCCTACCTCAGATTATACATAAGAGAGGC 

TGCCGCATAGTTGATGGGTTTGTCTTCTCTCTGTAGCCCTTGACCATGAC 

AGATGTGACCAGCACACAGATTGTTACAGCTGCACAGCCAATACCAATGA 

GTGCCACTGGTGCAATGATCACTGTGTCCCTGTGAACCACAGCTGCACAG 

AAGGCCAGGTCAGATGCTGTTTTTCACGGATTTTAGGGAATAGAAAAATG 

CTAGATGAGTGTGAGTGTAGGGCAAATMTGAGTAGAGTTGTTTTTAAAA 

TGGGATATCGATTTGMTTCTACTGTTGCTCAGGTTTTCTCTTAGGAAGG 

GATGCTATATACATCCTGATTCCAAGGATCGCTCCTGCTGCTGAGGTCTT 

TGTGCAGTGTTTCCGAAAGCATGTTTTACAGAATGCCCTTGGCCCATATC 

TGACTCAGCATGACATCTGGGGTAATCATGTATGATTTGTTATAGGTGAT 

MTAGGCTATGAGTMGGTGATCCAGICTTTTGCTGTCTTTGATGGCTTAT 

GACAl Mill I CTCAAAGTTTAATGCATTTCATAAGAAATAAGACTTGAG 

ATTGCTATGGTGGGCACGGGCTGGGAGGAGCTCTGGAAAAGCAGCAGGTT 

CAGCTTTCACGTTTTACAGATAAGCATTGGCTGAGGCTTGGTGGTGCCAG 

TGGTTCCGTTGGGCTGCTAGCTTGCCAGCTAAAAGCATGTTAGTGAGAAT 

ACACACTGTGGTATTCACATTGCAGTGCTCCTTCCTGTTCATTCTAATTC 

TATCATTCATCCATCTACCTATCTCTATCTATCTATCTATCTATCTATCT 

FIG.3D(21) 

SUBSTITUTE SHEET (RULE 26) 
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ATCTACCTACCTACCTACCTACCTACCTACCTACCTACCTACCTACCACT 

TATCTAATTCTATCTGTCTGTCTGTCTGTCTTTCTGTCTATCTATCCTCC 

ATCTMTTCTATCTATCTGTCCACCTATCTATCATCTAATnTATACATC 

CATCCATCTATCCATCTATCTGTCTGTCTATCATATATGTAATTCTAACC 

ATTCATCTATCTATCCACTTATCTGTCTGTCATCTAATTCCATCCATTTA 

TGTATCTATCTATATATCTAATTCTATCTATTCAATTCTTTTCI 111 1 1 I 

CTATCTTTCTTTCTGCAGTTACCATTCTCAGTTAATTCTCACTGAGTTAT 

TTGTGTGAATAACAAAACACTTCTCCCCTGTGTTCCAGATCTCCATTGCC 

AAGTATGAGAGTTGCCCCAAGGATAACCCCATGTACTACTGCAATAAGAA 

AACCAGCTGCAGGAGCTGTGCCCTAGACCAGAACTGCCAGTGGGAGCCCC 

GGAATCAAGAGTGCATCGCCCTGCGGG<3TAGGCCTTGCACAG<3GATGTCC 

tCTATAAGGTCCAAGCTTGGTGCTCGCTCCTCAGATCAAGGTGGACCTAG 

GMCMGATTGCTTATTCTGTCTATTTAGCCCTCTCACTATT^3GG6GGGG 

GGGGGGGCGATATmGTATGTTTrTMCTT^ 

GTATTTACTAGCCTTTGAAAGAAAGTGMGTCTCAGCTCATCTTCTGGAG 

AATTCGGGGGTAGCTTAGATCCATGTTACAAACTGTGTCCCACTGTCCTT 

CCTTCTGCTGTGAAGGAGAACCTGtaCACTAGAGCTCTGT^GTCTCAGCAG 

CAGTCAGGAACCTCCAG6AA<3CACTOCT^CAGTTCTGT€AGMGAGAT 

TTCTGTACCAGCATCATCT<K^CATCTGAGCTfGCTTC€C<^CTATTTCAG 

CAGAGGTTCTTCAGGGTATTAACTTAGGTCCTGAGGCCAGCTAGGCCT-GA 

CTAAATCTCTAT1^TCTAm<£^ 

FIG.3D(22) 

SUBSTITU'PE^SHEET^RULE 26) 

BNSCXXID: <WO _0006373A3JA> 
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TTCTGTGCTCTCCAACATCGAGGTTT6AGGGGAAGTTGGTCT6ACTCTTT 
TGAMGCATTTTATTTAGTTTGCTGMTGGGCTTTAGTTTAGCCAGTGTT 
CTATTGCTGTGMGAGATACCA7TTCCAACGTGTAACT7TTATGAAAGGA 
AACATTTAAGTGGGGGCTTGCAGTCTCAGAAGCTATTATCATCATGACAG 

ggagcatagaggcacaaaggcaggca™gagtggtagctgagagctaca 

TCCTCATCTGTGAGCAGAGGCAGACAAGGTGTGAAAAAGACAGAACCTGG 

CCTGGGCTTTTGAGACCTCAAAGTCTACCACCCCCAGTGAGACACTTCGT 

CCAACAGCTCCTGCAACAAAGCTCCATCCCCCGATCCTTCTCCAGTCCTG 

CCACTCCCTGGTGAATGAGCACTCACATATATGAGCCTATGGGGGTCATT 

CTTACTCMGCCACTACAGGCTTTGTTTTGTGTCTCAGACTTTATGTCAA 

TAGAATACCTAGACACCTTGTTACAAGACAGGCCTGGAAAGCCTGCAGTG 

CTGACTCCCTGCCAGTAGCACATTCTGAGGAGCAAGTCCCTTAAGTCGCT 

TACCTGCTCTTACATTACGCCTTTCCCTGACCATTTAGTGAGCACTGTTG 

GTGTCCCCAACCTGAACCTGGTTCTGGGGAAACACTTGCTTATTCACTTC 

CGTGCTAATGGCCAGGGAGCAAGCATGCTTTCATGCAACACTGTGAGTTC 

AGTACAACCACAGGAGGAGATTGCAGACTTCCTTCGTGTACTGTATCACT 

ATGAGGTTTTCCAAACCAGTCTCCCTTTCACCTCATTnTTGGCATGCCT 

TATGTACTTGCTTATACTTTCTATCTTATGACATGAAAACAGAGTGGCAT 

TTGGAGGCTTAMTTTATCACATTCCCMTTCMTTCCATTTTCAGTTTA 

CTCTTTCTGTATATACATCAGTGTGCAGATAAATATCTCTTTGTGTGAGC 

ATTGGAGGCCAGAGGTTAACCTCTGGTATA7TCTTCCTCTATCACTCTTC 

FIG.3D(23) 

SUBSTITUTE SHEET (RULE 26) 
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ACAG6GTCCTTTGATGAAT6TGGAGCTCACTGATTACATAGACTAGCTGA 

CTCAACCCTCAGGCCTCATAACCCTGCCTCTAGCCCTCAGATGAGATTAC 

MGCMGCAAAACTACGCCTCGCCTTTTATGTCGGTOCTTCGMTWt^ 

CTGGGTACTTATGCTTGACACAAGTATTTTATCCACT6AACCATCTCCCA 

AGCCTCCATTTGCAG1 I 1 1 1 1 ACCTCACCCTTCCAATATATATATTTATT 

TGTATGCCCTTTGTTCMGATTTTAGTCACCTTTTACATTTTTCTTCAAA 

MTMTTGCACCAATTTCTTAATAATGGCACCCAAAAGTAGGAACATTAG 

CCTAGAGTATACCCTGTGAGCCAGGAAATGTGACTGGTGAGACTreTAAA 

AGGGTCTTTTTATTCTGGCCCTCAGCGGA<3GCTCAGCAGTGGAGCAT'GCA 

TGCTGTTCCTCTGGAGGACCGGA^TCCCCAGGGGCCAQGTCACAACCAC 

TTGTMCmMCTCTGATCTMTCCCCTCTAT^CTTTTGTGCTATAGT 

CTCTTGCACTAACCCACACTCAAGGCACACATACACACATTCTTTAAAAG 

ATAMTTATTTTATTTTCAAAGG 1111111 CK3CATATAGAAGTTAATAA 

TTTGTCTGTOTGCTCACCAGATCCTAACAAAGCACCTGAMTTCAAATC 

AGGATGAGTTCAGATGTTCAGTATTTTGAACTAGTAAACCGAACTGCATA 

ATTCCTAAMCTTTGTTTTCTTTCCTCTfCCGCTTTAAAAAAGAAAATAT 

CTGTGGCAATGGCTCGCATTTGGTTGGAAACTCGTGTCTKaAAAATCACTA 

CTGCTMGGAGMTTATGACMTCCTAAATfGTCCTGTAGGAACCACAAT 

GCCTTTTT^CTTCCCTCACATCCCAGAAGAAG^TGGAGTTTOTCCTTAA 

GCAGCTTCGATTMTGCMTCATCT^^^ 

AGAACTTRSGmCTAGGGCACAACA^ 

FIG.3D(24) 

SUBSTITUTE *SHEET««(RULE 26) 

BNSDOCIO: <WO 0005373A3 JA> 



60/173 

GTTGTATGTTAT7TGCCTGGTAAGA6AAAGAATTACAATAAAT6ATTAAT 
MTATACTGTGGGCCTCTATTTCAGAGGCTCTTCTTTTGATACCTTTCTT 
CTTGTCTTAAAMGTTCAGTACTTTGCATATTTTATTAGTTCTTATTATT 
AAGTAAATTATAAGGTATGAACATATGGAATGAATGGTAATATGTGTACA 
TATTCTGGTGACATCAGATTATTTTGTACTTGATTTATATCTAGATTCTG 
CTTGGGAAAAGGGAGAGTAAAATGTTAGTTACCTAGGTGTCATTAAAGCC 
ATCTACAGCCCCTGGAGGTATTATTATAGCACATAGTGTAATCGTCAGTA 
AGAMTGTAAMTCTGCCCAGGTTTTATAGCCTTCTTCCTAAGGCTTCTG 
AACTCAGAAAGTTCTCTTACTCTAGAGCCAAACTCTCAAATGGCTTGTAG 
TTACTATATAGTCTCATTTGGTATTTTTCTTGGTAAGTCTAATTCTAAGA 
- CTTGTGATTTGACTGTGATGCTTCAGTCAATTAGATATTCACAGAGCAGC 
TmCTGTCTATGCTGGCTGTGGTACAGAGAGATGTGAGGGACATGTTTT 
TGTCTAGCCAGGAGAAGACAGAATGCAGCTCAGCATCTCTCATTTGGCAC 
CACCTTCATGTGATGGGATGCCGGTATGGTGTGGGTCCTGGTTGTTAAAT 
CTCAGGAAGTCCATATATCCAGAAATGACCTCAACTATAGGTGGATTTCT 
GGCAATTAGGTAAAAGTCAGCATTCCTTGGGCACTTGGGAAACTGGTTAC 
CATCTGCATAMGGAGTCATTTCCCTTCTATCTGGCAGAAGGGACATATG 
GCTATCTATTGTGCCTGTCAGCATGGAAGCACATGCTAGTCTCCAGGTCC 
CCCCMTATCACAAGTACCTATAGCAGTGMTTAGTTAMCTGATTTGGC 
TCCCAATGGGTCAAGTACAGCTGCACCTGCCCAAGAGCTCTTTGGGTTTG 
CAAATGAGAGACACATAGTTMTT7TTATATGCTTTGACTAGTTCAGTTG 

FIG.3D(25) 

SUBSTITUTE SHEET (RULE 26) 
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CTGGACATTTCTAATCCTCCCTGCAGTAGCATACATTAACCCCTCCAACT 

TTCCTGAGTCAACTTACTAACTCAACATTTCATCTCT6ACACCCCAGACC 

TAATGGCAGAGTGGCCCTTAGAGCCACTTT-CCCAA I I I 1 1 I I I I I ATCAG 

ATATTTTCTTTATTTTCATTTCCMTGTCCCCTTTCCTAGTTTCCCTGTC 

CTCTCCCCCTGCTCCCCAACCCACCCACTCCCTCTTCCTGGCCTTGGCAT 

TCCCCTATACTGGGGCATAGAGCCTTCACAGGACCAAGGACCTCTCCTCC 

CATTGATGACCGACTAGGCCATCCTCTGCTGAATATACAGCTAGCACCAC 

GAGTCCCACCATGTCTTTTCTTTGATTGGTGGTTTAGTCTCAGGGAGCTC 

TGGGGTACTGGTTAGTrCATATTGGTGTTCACTTTCCCAAATTCTTACAT 

GGCT^TTTAGTTCTTTCCTGCAGCTCTTAGGTCTMTCCCTTTCCTTCC 

TCTGTCATGGTGATTGCCTTCCTCTCCTATCTCAGTTCCTTGGCTGCTCA 

ATCTAAMGTCCCACCTCCATCTTTCTGCCCAGCCACTGGCTGTATGCAG 

TTCTTTATTATCAGTTGAAGCCAGCTAGGGGCAGAGAGCTTCAGGTCTGT 

AAGTGCTTTGGGGAGCAGMTTMGACAAAGCATTAGAACCAATTCCCAA 

CMGTACCT<3CTATACATTTCAAAGTCCATATTAGTCTCCTG<3GTCTTCC 

CTTGCCCAGCTACTTGTCCTCCTTGTMTCCAMT^CAAGCTTTTTCAC 

ACATCTCTTTATCTCACATTTCCCTAGC^CT^CCATGTCCACTTCTTCT 

TTTACTCTCTGCTCTCCTCTCTTTCCMT^^TCTKaGATATTTTCTCTCT 

CTTATTCACMTAAAAACCAMCCAAAG-CAAACAAAAAACCTTACGCTAA 

TMTCGAGTlGGTCAC<^T^^ 

TCTTC^TCTGACACACTHSGCAG^^ 

FIG.3D(26) 

SUBSTITUTE SHEET (1RULE2S) 

BVSOOCID: <WO 0005373A3_IA> 
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CTGAGAGAAGCAGCAG6CACCTCT6AGGTGGTAGTTACTAGAGTGATTAG 

AACAGACAGTGGAGACGTGGCTGGAAATATGGACTCTGGTGTTTGGAGCC 

AAGTATGGTAGGCGGCAGAAGCCAGCAGAAGCATGATCCACACCTTCACC 

AGGTTGCTTCCATTGGGAAAGGCTGGACCCC7TGGGAAGGGGTCCCTTTG 

TGCCTTCCTAGGTGTTCGGAGCCAGGTGTGTGAGGGATACAGTAAAGGGA 

CTGACTGCATGACTGCTCCATTAGGGTGAAGGGTTTTGTTGTGAATAGGA 

GAAACAAAATGTGCAGAGGCATCTGGGAGAGAGCAGAGCAGAGTGAAAAG 

GAAGCAGTGTAGGCATGGTCAGGGCTAGGGACAGCGGAGACAGCAAGATA 

GCGAGTGGGTGATAAGGTGAGAGAGAGTGTGTGTGTGCGGTGCACACATC 

ACGTGCATTATAAGGAGGCTGAGTAGCTAGCTGGGGGGAGGGAAGGGCCA 

GAAAACTAGCATGCACTCTGAAACGGGTACTTGTGATGCTGAGGGAGCTT 

GGGGGAGAAGGGCATGCCTCAAGACCAGAAGAGGGAGTTGGAGTTACAGT 

TTGTAAGATGCCTAATTTGAATGCTGAGATCCAAACTCTGATCCTTTGGC 

TGAACATCATATCTGCTGAGCCATCTCTCCAGCGCCTAGAAAGGTGGTGA 

TGGTGGTTGTTCTTGTTTTGTTTTATTTTGTTTAAATGGGGAGCCAGGTA 

CAGTACATCATGCCTTTAATCCCAGCAGGAGATTCAGGAGATAGAGACAG 

GTAGATGTCTTTGAGTTCAAGGGCACCTTGGTGTGTATAGGAAATTCCAT 

CCACCCAGGGCTACAGMGGGTACCTTGTCTTTAAAAAAAAAAAAAAGAA 

AGAAAGAAAGAAAAAGAAAAAAAAAAGAGAATGAAATTTCAGAGTTATGC 

AAGATAGGAGCTCAGTGGTAGAGTGTGTGCCCAGGAAGTGCTGGGTTTCA 

CTCCTCAGAACAACAGCAGGGGCAGAAACTAGTCTACAGGTTCATGAGTC 

FIG.3D(27) 

SUBSTITUTE SHEET (RULE 26) 
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GTGTTTTGTTTTGTTTTACATAAAATGTGTTGAATTAGATAAGTAGATAA 
AATGTGACTCATACACAGATAAATAGATAAAATGTGATACATGTACCTGT 
ACATAGMG1ATTATGATCTCACCTTTAAAMGGAGGAAATAGAGAGTTTT 
GGTAGTTACACCACAGGAAAACTGGAAAAGAAAATGTATATATGAGGCTG 
TGCCCCATGGCTAAAG6MCATGTT7TTMGTCATTTGAATTCAGCAAAC 
AGTTTTAGGTMTGATATATCGTTTTCCATACMCCAGTATTTTATAAAT 
ATTAGCMGGTCACATCATTTATGMCCMCATTTAAACTAAATTTGTAA 
ATCATCATTTCTTTATAGCACTTGTCATAGAACATAAGTAGTTTAAAATG 
TGATTATTGCTTTGCTCTTGATGTCTGAAAATCTTCATGTATTCTCTTCT 
TTGAGCCATTTTTATGCTTTGCAGTACTGGATGCATATTCAAGTCATCAC 
TTATTTTAATCTAC CTTGC CTGAGTTTGGGGAATAGATGGTTTCC AC ATG 
TCTGTGGGTTATGCCTAAGCTAGTGG I 1 1 I I ATGTTAGAGCTTGTTTPGG 
GGMGGCACTGGTTGCATTCATAGCTGTGTTTCTTTTGCGTGTAGTCCAA 
GCTCACTCTGACTCCATG^GTT^GTCTTGGGAAGATCAATCTGTCTTACT 
GGTGCTGGGAGGATATGTCTCCATTCACAAATAGTTTGCTCCAGTGGAT'G 
CCATCTGAGCCCAGTGATGCTGGCTTCTGTGGGATCTTGTCAGAGCCTAG 
TACTCGGGGATTAAAGGCTGCAACCTGCATCAACCCTCTCAATGGCAGCG 
TCTGTGAMGGCCTGGTMG6ACATGGGTGCATATAGTGCTGCAGGAGGA 
GCCAAGACAGCAAAGGAGGCACAGCT<W\T-GAGCGCTGAGGTGAT-GAAGT 
ACTTATGGCAGCAG^aGAGAG^aA^ACCAATTTAGi^ATATCTAiTTCAM 
CAGAACXX^mCAGATAGTCTTOi^ 

FIG.3D(28) 

SUBSTITUTE SHEET (RULE 26) 

BNSOOCID <WO O005373A3. IA> 
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TACTGAAAACCAAAAATAAAATTGGTGAAAGAACCCAGTTTATATTGAGC 

TGCACTGTTTCGTT6GTCTCAAAGTGTT6A6AATTGTTCTAGAA6ATTAT 

TTCCTTGGTGTTGGCAGAGAAGTGCTATGGAGGAAACAACAACCTGAAAC 

CAAAGAAACATTTAGAAAAGCAGCAAGTCAGGACACTATTCAGACACTGC 

TGGGGTGGGGGGAGAGGGGCATGGCCAAAGAAGCCGACAGAGCCAACACC 

AGGGTGTGGCAATGTCCTGCGCTGAGGTTAAGGTTAGACTCCATGAGGCC 

AGGCCCAGAACAGCCATACACAAATGAGGACTCCAAAACAAGAGGTGCAA 

GTGTAGTGGAGACTCCATCCCTGCAGGTCCTGTTTCAGGAAATGATTGTA 

CTTTGCCTGAGTMTACAGCCTAGGAGCTACTTTCTGATAGGGTTTTTTA 

MTACTTACAAAGAATTATTTATCTTTAATCATGTGGTTTTGTATGTGTG 

TGCTTGCACATGCAGTGCTTGTGAGAGAGAGTATGTGTGAGAGCATGCAT 

GTATGAGAGTGTGAGAATATATGTGAGAGAGTGTGAGTGCATGTGTGCGT 

GTGTGCATCTGTGTGTACAGGTGTGTGTACATGCATGTGTGTATAAGAGT 

ATGTGAGAGTGTGGGTGTGTGTGTGAGAGTATGTGAGAATATATGTATGA 

GTGTGTGTGAGTATGAGTGTATGTGCGTGCCTGCATGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGAAGTGGCCTTGGAAAACA 

GAG7TGTCAGATCTCTTAGAGATATAGTTGCAGTTGGTTGTGAGCCATCT 

CATATGAGCGCTGGAAGTTGAAATTGGGTTCTCTGGAATCCTCTGGGTTC 

CTTGTTGMGCCTGMTATTTTGATAAATATTTATGTCATTATCCCTCAA 

MTTGTAAATGTAGAATTTAACAAACTCAGGTCTTGAGTCATCTTTGTCC 

CMGGTTTGTTTGTTTGGTTTTTTGTTCCCCCACCTTTTCTTCAGTGCTT 

FIG.3D(29) 

SUBSTITUTE SHEET (RULE 26) 
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TTAAAAM6AGAGTCCATTTTTTCCTAMT6TTTAAATACAGTTGAGGAA 
TAGAACATCTGACTCCAATnCCTGGGTTTCCCTCCATGTAGT-GTAGTCC 
TGACCTGAmCAGTGTGCATTGAAMCTn-GATCACTTGGAAGGCAGCT 
ATGCTCACCACTATACTACCAATGTCTGCAATCCTATAGGAGAAACAACA 
ATATGAACTAACTAGTACCCCCCAGAGCTGTGTCTCTAGTTGCATATGTA 
GCAGAGGATGGCCTAGTCAGCCATCATTCGGA<k3A<3AGGCCCTT€GTATT 
GCGAAGATCATATGCCCCAGTACAGGGGAATGCCAGGACCA6GAAGCAAG 
AGTGGGTGGGTTGGGGAGCAGTGCGGGGG<3GGGGGGTATAGGG<5GTm-G 
GGGATAGCATTTGAAATGTAMTGAAGAAMTAACTMTAAAMTTGCCT 
TAAAAAAAAACAAAAMGAAAAG7TTTTGATCTTAGCTGACCAGTGTCTC 
TTTGGGTCTTAATTTCCAGCAAACCACAGTGCCAAGCAGTGCGGGACACC 
ATGTGCCCTGCGGACAGCGTGTGGCGAGTGCACTAGCAGCAGCTCGGAGT 
GCATGTGGTGCAGTAACATGAAGCAGTGTGTGGACTCCAATGCCTAGGTG 
GCCTCCTTCCCTTTTGGCCAGTGTATGGMTG^TATACGATGAGCAGCTG 
CCCACGTMGTG6AAGGAGCTTTTGMCATTT^A€GCM<3iTGG<^TTC 
ACmCTGCTCMGTCCATGCAGAAGCTCGTCG^OGGCGCTTCCAGAT 
TMCATGTATCTATAGMTGCAGCACAGTCTTCCATGCAGTAAATCAGTT 
ACATCMGGAGMGGCACAGGGTACAGAMTACCTTTTCTTCTTCAGGGT 
MTATTATMTTCMTCTGTATAATGTTTCTACATCTTAATCTACCAGTA 
TGTAMGTGCTTTCTAGTAGAG^GCTCC<XAGCTCCCTTTTTCATCCAAC 
ATCCTWATTAAM^iT<3GAAAA€TCCCT£TTATATATTATGTAAAAT 

FIG.3DO0) 

SUBSTITUTE SHEET-fRUUE 26) 

BNSDOCID: <WO 0005373A3 IA> 
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GTGGGGCCCTTTAAATTATTTCAGTTCAATAATCACTATAGGGTACTATT 

TTTAATTCATGGAAGTTAAATCATCTGTTAAAAGAAAAGGTAATAACAGT 

AMTTCAAATCTTGTGATAGTGAATTACAAGTTGGATTGTTTTGCCTTGT 

TTTTTAATAGCTGAAAATTGCTCTGGCTACTGTACCTGCAGCCATTGCTT 

GGAGCAGCCAGGCTGTGGTTGGTGTACTGATCCTAGeAATACTGGGAAAG 

GAAAATGTATTGAGGGCAGCTATAAAGGACCTGTGAAGATGCCGTCACAG 

GCCTCTGCAGGAAATGTGTATCCACAGCCCCTTCTGAACTCCAGCATGTG 

TCTAGAGGACAGCAGATACAACTGGTCTTTCATTCACTGTCCAGGTAA6A 

TGCCTGTGTATCCTAGTTCAAATCTCGTAGATAAACTAGACGCCCAGATC 

CCTTGGCTCACTTGTTTTCTTGACTGTGTTTGAGTTCTTTCTGTGTTCTG 

CATCACCTTGTTGGATCATAGCTGGCAAAGGTGCTCTCCTTTCTGTGGGC 

TTTTTCTTTACTTGATTGATTGTTTCTTTGGTTGCACAGMGCTTTTTAG 

CTTTCTGAAGTCCCAT7TGCCAGTTGTCCTTAATTCCTGGGCGAGTAGAA 

GCCTCATAAAAAAAAGTTCCTTCCTACACATGTATCATGTAGGGCACTGC 

CTATGTTTrATTCCAGMGTTTCAGAGGTTCGGGTrATGTCTTTGATCCA 

TTTAGGGTTACTTTiTGTGAAAGGTMTGGACACAGTTCTGTTTCATTCA 

TTATTCTACATGTGGACATCTACTTTTCCCAGCACCAGTTTTGAAGATGT 

TATCTTTTCTGCAGGTTGTTTGTTTGCTTGTTTTGTCTCTTCAGAAAATC 

CCAGATGGCGGTAGCTGTGAGTGCTTAGGCTTGGCCTAGCTGTTTCATTA 

TGiTGGCTTGCATGTCTGTTTTGTGCAGTGCCACCATATTGTCTTAATTG 

CTATAGCTCTGCMTCTATCTTGACATCTGTGTTGGCAATCCTGCAGTTT 

FIG.3D(31) 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID <WO 0005373A3 IA> 
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CGACCCTTCTGCTCAGCAGTGCTTTGGCCATCTG^GTCTTTTCTGGGTT 

CATAATGAATTTTAGGA I 1 1 I 1 1 1 1 1 CTATTTCTGAGAAAGTATTGTTGA 

TATTTTGATTGCGATTGMTTGMTCTGTAMTTGCTTTTGGTAGAATGG 

TCATTTTCACMTATTMTTTTACTGATCCATGAACATAGGATGACTCCA 

GTCTCTCATGTCTCCCTATAGCCCTGTCTTAAGAGATTPGGAGTmCAT 

TGTAGAAGTCCTTCACCTCCTTGGTTAAGTTTATTTCTAGATATTGTATT 

GTCTTTGGTATTATAAATGGTAGTATGTCCATGATCTTGTTCTCAGTGTT 

TTTTTAGTTTAG 1 1 1 1 1 1 1 1 MmATCTGTAT<^TGTTOTTTTATAT 

ATGTGTATATGTGCATTCATGTCCTCTGGGCATCAGATCCCCTCGGACTG 

GATTTACAGACAGCTTTGAGCTGCCTGTAGGT'GCTGAGAATTGAACCCAG 

GTCCTCTGCAAGAACAGCCAGTGCTCCTACTCCCCAGGGCCAGAAGTACT 

MTTT7TAAGAGCTGATTTTCTACCTTTGCTGACATTGTT<^TTGTTTCT 

AGMGTTTAGTGATAGAGTTTTTGAGATTTCTTATATATCTTATGTTATC 

TGTAAAMGGGATMmGACTCCTm(XTAmATATC^TmATTTC 

TTTCATTTGCCATATTGTTCTAGCTAGTGCTTCCGGCTCAGTATTGAAAA 

GAGTGGTGATTCTGMCAGCTTTrCTTATTTCTTATTTTMTG 

TCACCCATTTAAGATMTCTTCGTTATGBGTTTGTCATACACAGCCCTTC 

TTATATTGAGGTATGTTCCTTCCAGTCCT^^ 

TTTTTMTCMGAAAGCATATTGGGTTTTTTtaTTOT^^ 

GTTTTTCTAGACMtaGTTTCTCT^^^ 

CTCTGTAGAGCAt3GCT^CTT<^ 

FIG.3D(32) 

SUBSTITUTE SHEET ^RUbE 26) 
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CCCGAGTGCTGGGATTAAAGGCGTGCACCACCACTGCCTGGCACATGTTG 

GTOTTTTGCAAGCCCTTTCTACATCTACTAAGATGAGCATGTGGTTTCA 

TCTTTGTGTGTTTATATTGTCTGTTGTATTTATTGACTTATGTGTGTTGA 

GCCAACCTGAAGTTCTGGGATAAAACCCACATGCTTTGGATGAI 1 1 I IGT 

GCTATGTGCTTATATTGTGTTTGTTAGTGCTTTATTGAGGACGTCTGCAT 

CCGTGTTCATCTGGGGTACTGTCTGTAGT7TGCTTATTTTGTTGTCTTTA 

CCTGCTCTGCA7TTTAGAGTMTCCTGGATTTATAGAAAGCATTTGGGAG 

TAGTCCTTCTGTTTATTAAAAAAAAAAATTAAGAATGATTGGTTGTTGTG . 

TGGTGGAATTCTGCTGTGAACCCATCTGGTTCTGGACTCTATTCGGAAGG 

CTTTTTATTACTGTTTCAGTCTCCTTGTTTGCAGTGATCTATTTAGGTTG 

CTMTCTCCTTATGATTCATTTGGATGAATCAAGAAATTAATCCATCTCT 

TTAGATTTCCAGCTTMTGGMTATGAGTGTTAMGTATTTCTTTATAGC 

ATTCTGTA I I I 1 1 I GGCATCTGTTGTAATATTTCCCTGTTCTTTCTGTTA 

ATCTCTTTCTTTCTTGTGGTTAGTTGGGCTAAGAGGCTCTTGGI 1 1 II 1 1 

1 1 1 1 I I I I I I I ATCTTTTTAAAGGACCAGCTCTTAGATTCATTAATTCTT 

TGTATOTmCCTTGmCTTmCACTGATTTCATmAGATTTTATT 

ATTTCTTGCCATCTACTGCGTTTGGGTTGGTTTTAGTTA I I I HCCAAGA 

TTTTCAGTTTCATCACTMGTCATTCATTTGGGCTCTTTTGGGTTTCTTC 

ACGAGMCCCACTTGGGACTGTTACCTTCCCTTTTAGACCTGCTTTTAAT 

GTGCCCCAGAGATTTGTTACATTGTCTTTTCGATTTAACTTAGTTTCAGG 

MTATTTTGATTTCTTCTTTGACCCATTCATCATTCGGTAATGAGTTGTT 
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TMTCTCTAGTGAGTTTATACATTTATTA6AATTTTGTTTACTGATGATT 
TTMGGTGTTTGGCTTTGTTTGTTTOT 

GGGTTTCTCTGTTGTAGCTCTGGCTTTCTCTATGTAGAACAGTCTGACCT 

CAAATTCACAGAGATCCACCTGCCTCTGCCACTGAAGTTCTGGGATTAAA 

GGTGTGTGCCACCACTACCTCGCTGATTTTAAGTTTTATTACATAATAGA 

CAGGTAGGGTACATAGATTTTCTACATTT<^ 

CAGCATGTMTTCTGTCTGCTGCTGAGGGMTCTATGTTGTTTTGACAGT 

TAGGTGGAAAAGTCTGTAGACATCTGTTAGATCCATTTTACATTTCAAGA 

AGCCATTTMTTCTGMGTTTCTCTGCTTATT7TTTCCCAGGTGACTTAC 

CTATTGGAGAAAATAGGGTGCTAAAATCATTTACTATTA7TG I 1 1 1 1 1 1 1 

MGMGAAMTMTTAATTTAAAAAACCCTGGAAAGAAAGATACCAAATC 

TGMTCATGmCCTGGATAGTGG€GTTATAm<aATCAmATrT7TCC 

TCTCAMTACTGTGAGTTTTTACAAT<^TMCMCATAAATATTTTTAT 

GTTGCTGTGGACTTTMCTTTGCTTT^TMTATATTTCG 1 1 1 1 1 J GAGA 

CTMTrTCTTTTTGATATTTTATTTTC 

TGGTTTTGTTTTGTTTGTA I lilt I AGACT-GGCCACCAACTTGCTATGTT 
GTCMG^TGGCCTTAAAATCCACACCCMTACmGTCCTCTCTTTCTT 
TCTTTC i 1 M 1 1 II 1 1 1 I A TTCGAACAAAATTTCTAGGTGGGAATCTC AC 
TATGTTACCCAG^TGACCTGAMCTOTOG^nAAGCAAGATG^GTGC 
ACAT<^TCAGAGAC<3CTGCG^^^ 

TATA^CACAGACAGCTGTACTOACTCAmCMT^mAACAmAG 

FIG.3D(34) 

SUBSTITUTE SHEET^RUbE«26) 
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ACTATATGCAAATAAATATGAAATGTATTCACCAAGTTCTCCTATGGGAG 

AMCAGAGCCCTTMGATTTTTTCCTTTCAGCTTGCCAGTGCAACGGACA 

CAGCAAATGCATCAACCAGAGTATCTGTGAGAAGTGTGAGGACCTCACCA 

CGGGCAAGCACTGCGAGACCTGCATATCTGGCTTCTATGGTGACCCGACT 

AATGGAGGCAAATGTCAGCGTAAGTCACACAGGTCAAG7TAGTCACAAGT 

CAGGTACAATAGTACAGTACCTGCAGTTGACTTAAATATCTTAAAGGGAA 

MGGCCTCTTGGTTTGGGATATTGCCTTTCTTAATTATGTTAAATTGTTA 

AMGTTrAACTGAGGGGCTAGAAATGTGGCTCAGTrGGCTAAGAACACTG 

ACTGTTCTTCTAGAGGACCGAGGTTCAATTCCCAGCAGCCACATGGCAGC 

TCACAAGTGCTTGTAACACCTGGGATCCAACAACCTCATACAGACATACA 

TGCATGCAAAACACTAATATACATAAAATAAATCCATTAAAAAGTGTTTG 

ATGATGCTGGAAGAGGAAAAAAGGCTCAACTTGTGGGTTTGGGAGCAGTT 

AGTTAAAGCAACAAACCGACAGTAAAGGAGCTAAGCTTTTATTTCTTCAG 

CAGAGGCATAAACAAGGGGCCGAAGTCACTGAGGCACCAGCTGCCTTTAT 

TCCATTTCCCTCCCATGGAAGCACATCAGCTCAAGTCAAGCAGAGCAGCC 

TGGGATGGGAGGTCATCTCATTGGAGAAGGAGGCAGGAGGCATTGTGAGG 

GGAGGGAGGACAAGGCTGGGAATGGGAAGTCCTGAGCTCAGAATCAGAAT 

GAGGACMGATCTTCAGTTTCCTTCTTAATATAAAGAGGTATCACAGAGG 

TCTCTATAGAAGTCTACTGGAAGCCTCACACAGGCACAAGGGTACATTTG 

AAAAACTGTGACAGCCAGGGAGAGTCCCCTTCTGAAGTCTCCTTCCTCAG 

AGACTGCAGCACCTGACTGTGCCCCAGTCTfiCMGAGGTTTGGGGAGAGC 

FIG.3D(35) 
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MCTGACCTCCTGAGGACCCCAGAT6AATCTTrAAGAT43GC^T€CTTTPG 

GTmGGTTGGTTGGTTTTTAGACAGATCTAGGAGAGTT'GGTGATGAGCT 

TGAATTCTCTGTCCTCCTGCCTGACCTCCAAATGCCCAGCTTCACATGGG 

CTCCCATTMGTTGTGAGTTTGGGTGTCT^GCTCCTGCTCTCACAGCCAG 

TGCAGTACATT<^GCTCCATAGAGATAGCGCGGGGGCAAAT-GAGAGCTGG 

ACGGGCACTGGGTGACTCTGTCCCTTGTGCGGGAAAATCAACTAAACATG 

GGCAAAGGAGATCCTAAGAAGCCGAGAGGCAAAATGTCCTCATATGCACT 

CTTTGTGAAAACCTGCT<MGAGGAGCACMGAAGAAGCACCGGGATGCTT 

CTGTCMCiTCTCAGAGTTCTCCMGAAGTGCTCAGAGAGGTGGAAGACC 

ATGTCTGCTAMGAAAAGGGGAMTTTGAAGATATCGCAAAGGCTGACAA 

GGCTCGTTATGAAAGAGAAATGAAAACCTACATCCCCTGCCCCCAAACAG 

GAGAC C AAAAC GAAGTACTAGGACCCC AATGCACCC AATGCCTTCTTCGG 

CCTTCTTGTTCTGTTCTGAGTACCTCCGCAAAATCAAAGGTGAGCACCCA 

GCTTATCCATTGGTGATGTTGCAAAGAAACTAGGAGAGATGTGAACAAG-G 

CTGCAGCAGATGACMGCMCCCTA^GAGAAGAAGfiCTGCCAAGCTGAAG 

GAAAAGTAGGAGAAGGATATTGCTGGCTACAGAGCTAAAGGAAAACCTGA 

TGCAGCAAAAAAAAAAAAMGGGG<jGTC<^CAAGGCTGAAAAGAGCAAGA 

AAMGAAGGMGAGGMGATCGGAGGAGTATCA^GAAGACGAGGAAGAAG 

AMGATGMGMGMTAT^T^TGATGMTAAGCT^TTCTAGTTTTTT 

TCTCATCTATAAAGCAmAACCCtt^^ 

AGAAAAAAATTioAMTGTAAGtt^^ 
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AGTGTC i 1 1 1 1 1 1 1 GTATAATTAACATACTGCCGAATATGTCTTTAGATA 

GCCCTGTTCTGGTGGTATTTTCAATAGCCAGTAACCTTGCCTGGTACAGT 

CTGGGGGTTGTAMTTGGCATGGAMTTTAAAGCAGGTTCTTGTTGGTGC 

ACAGCATAAATTAGiTATATATGGGGACAGTAGTTTGGTTTTGGTTTTAT 

TTTTGGGI 1 1 1 1 I 1 1 1 1 I CATCTTCAGTCGCCTCTGATGCAGCTTATATG 

AATATGATTGTTGTTCTGTTAACTGAATACCACTCTGTAATTGAAAAAAA 

AAAATCGTGGCTGTCTTGACATCCTGAATG7TTCTAAGTAAATACAGTTT 

TGTTTTTATTAATATTGTC CTTTC GAC AGGTCTGAAAGTTTTCTTCTTGA 

GGGAAAGCAGTCTTTTGCTTTTGTCCCTTTTGGGTCACATGGGTTACTGC 

AGTGTGTATCTTTTCATATAGTTAGCTGGAAGAAAGCTTTTGTCCACACA 

CCCTGCATATTGTGGTAGGGGTAACACTTTCATCCATATTCAAAGAATCT 

CCAAAATCGTGATCAGTTGGATAAGAAATATTATATAACCTACTTGGCAA 

AGCAAGGTGTGATCAATTCTGTCACACCATGGGATCATTAGAATCAAGCA 

ATCTGAAAATCTGTCCTTAAAGGACTGATAGAAMGTATTTTCTAATCCT 

TATACAMGGCTCTCCTTTAACTGCCACTGCTATGTAATGACAGTTATGT 

TTTGCAGTTTCCCTACTAAAGAAGACCTGAGAATGTATCCCCAAAAGCGT 

GAGCCTAAACTACACAACTGCAGTACTATTTGTTGACCTTAGTCCCAGCG 

AAGGCTATCACGAGAATGCTAGCTATAATATAATGCCTCTGCCCCTCTAT 

CTAAATATGGATTCCTCAGGAMCTTGACTGCTTAAAGGTATTTTTTTCA 

TATTGTTGTTCCTCCTATAGGGTTGCAGACCCCiTTAGCTCCTTGGGTAC 

TCTCTCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATC 

FIG.3D(37) 
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tatctcttgtcagatttc 1 1 1 i 1 1 cttttctctttctttctttctttttt 
tm6atttatttattattatttctaagtacactgtagct6tcttca6atg 
caccagaagagg<3tctcagatctca™cggatggttgtgagccaccatg 
tggttgctgggctttgaactcaggaccttaggaagagcagtcggtgctct 
tmccactgagccatctctacmcccttamggtatttttaagtagttga 
gtcagcttttaamttatgccagaagtgtcaamgttcaaaagtttagga 
ccatcctctattgaagtacagggtcatcctgggctacatgagagcctgcc 

TTAAAACCAAAATGAAAGAAACAACAGGAAAAACAAGAGTTAAGAAAGAG 
AAAMGMGCACTTCGAMCAAAGATCTCTCGAGTATGTATAGGCTTCTC 
TACMCAGGTGTATGTAGGATCTTGATGGCTTTTGAGTCTATTACCCTCA 
AAGAGGTACTGAGAAACCTAAATGTGATCACGGTGGTCTCTGAGGGGCAC 
CTGGCAGGATTATGGGAGATAACTAAAGCTTGCTAATCACAGAGTTTAGG 
GAGGGAGGACGTCTCTMGGCMGTTAACTGTCTGGTTTGAGATGCTTAG 
GTGATGTCTGAGGMGTMTAAGGCCTGTCCATTTTCATACACACTCAGG 
CCTTMGTCTCGGTMTCGCTACTTGAACATAAAATAGTCCTCTATGAAA 
GGMTMTATCTCTGTGTCAGCAGCCTTCAC<3GCTAATGTTAATTjGTOCA 
GGAACCCTGCTTCTCAGTCAGACAGAAGCTCAATCAGGCAGGGGCAGGAC 

TTCTTTGCCTTTCCCATGTCCTTGTMTTT^ 
TCAAACATACTTACCTGTTAGGTAATTATAAGAACACCAAATATTACTGA 
ATAAMTGTGTTTAT<^CTTT<aTCGT^ 
CTTAGCCAGCMlKaAT<3GCACAGGC^^ 

FIG.3D(38) 
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GAGATAGGCAGATTTCTGAGTTCCAG6ACAGCCAGGGCTACACAGAGAAA 
CCCTGTCTCGAAAAAACAAAACAAACAAACAAAAAGATTTCGATGTCTTT 
ATCACCCAAATCAAGTAACTTTCCAAAGTCTCACAGTGAGATGTAGCCTA 
GTTGGGAGCCACATCTMTATATGCTGATGATCTTAACAAGTAGCCTGCT 
TGTGTCTTCAGGTGACCACCCCGGTGTCCTCAGCTACCTCTAGAAAGATC 
ACACTTTCCTCTGTGGTCTCTGCAGGGTCCCTGTATGATTGTGGAACCTT 
GCTGTACTTCTCAGAGTCCTGATTCATAAAGCACTGAGTTTTTGCTTGTT 
TGmGTmGATACTATrGGTAAGMTATATATTGAACCTTGACATGCC 
M l 1 1 AAMTMCATTATTTTTACAATAGTACTTTAGCCTTGATTATGTT 
AACTGCTTACTGTTTCAGATGACATTCGTACATCT77TAATCCTCAAACC 
AGTCCTATGAGATGGCTAGCATCATTGTCACATCATTTAGGCAAGGAAAC 
AGGTCTTGGGTTAAGCTTCATGCTCAGAGCTCCTrGGAAeACAGTGGACT 
CMGTGCAAGCAGACTGACGCGACTGGGTTTTACTAATTCAGTAAGCCTG 
TACTCTATGGAGGAAGAGTTTCTGACCACTGGATGCAGTCTGATGACCTC 
TGACTGTTCTGTTTGAMGGTiTCTTTCAGTGATTTrATiTTTCTC 
TGGACi mil I CCAGCTTTTAAAATATATATATATATCTTATTCGCTTC 
ACATCCTGCTCACTGTCCTCCCTCCCCTGTCATCCCCTCCTACAATCCTT 
CATATCCCCCCTTACCTTCTGAGCAGCTGGGAGCCCCTCTGGGTATCCCC 
ACACTCGGGCACATCAAGTCTGTGAGGCTGGACGCATCTTCCCCCACTGT 
GGCCAGACAAGGCAGCCCAACTAGAACATATCCCACAGACAGGCAACAGC 
TTTTAGGATAGCCCCT^TCCAGTTGTTCAGCACCCACATGAAGACCAAG 

FIG.3D(39) 
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CTGCACATCTGCTACATATGTGCAGGGAGQCCTAAGTTCAGCCCAT6TAT 

GTTCTTTGGTTTGTGGTTCAGTCTCTGAGAACCCCAAGGATACAAGTTAT 

CTGACTCTCTTAATCTTCCTATAGAGTTCCTATCTCCTCTGGGGCCCACG 

ATTGGTGTCCCTATTGCTTCACTGGGATTCCTGCCTGGCTACACCCACTA 

TGACCMGGCMGTCTTAGAAAAGACAACATTTAACTGGGGCTGGCTTAC 

AGGTTCAGAGG7TCAGTTCAGTATCATCAAGGCAGGAACATOGCATCATC 

CAAGCAGGCATAGTATAGAAAGAGCTGAGAGTTCTACAACTTATCTGAAG 

GCTGCTAGCAGMTACC<^CTTCCAGGCAGCTAGGATGGGGGTCTTCAGA 

CCCACACCCACAGTTCGTCTCCCTATTCCTTCACTCGGGTTCCTGCCTGG 

CTACAGGAGGTAGCCTCTTCAGGTTCCATATCCCCAATGCTGTGAGCCAC 

AGTTAAGGTCACCCACTATTGATTCTAGGGTGTCTCCCTCATCCCAGGTC 

TCTTTCATTGTGGAGATGCCCCCCACTTCCCCACCACTGTCAGTTGCAGA 

TTTCCATTCTCGGGACCATCTGGCCATGCCTTCTGTTTCTCCTCACACCT 

GATCCCGACAGCCCCGCCCATTCCTTCTCCTACCTAGTTCCCTCCCTCCA 

TATGCTTCCTATGACTATTTTATTCCCCCTTCTMGTGAGATTCAAGCAT 

CCTCACTTGGGCCGGCCTTCTTCTTTTGTTTCTTTCGGACTC^ 

AGCTTGGGTATCCCATTTTTTTATGGCTAATATCTCCTTATAAGTGAGTA 

CATACCATTCGTGTCCTTTTGGGATTGAGTTACCTCACTCAGGATGGTAT 

TCTTMGTTCTATTCATTTGGCTC£AAMW^ 

GTMCTGMTAGTAGTCCACTGTATAGATGTAGCACAGTTTCTTTAtCCA 

TOmAGTT^GTC^TCTAGGTTGmCCAGTnCTGX^TAmCM 
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ATAAA6CTGCTAT6AACATAGTGGAGCATGTGTCCTTGTGGGATGGTAGA 



GCATCTTTTGGGCATATGCCCAGGAGTGATGATATAGCTGAGTCTTGAAG 

TAGMCTATTCTTAGTTTTCTAAAAAACCACGAAATTGATTTCCAAAGTA 

GTTGTACAAATTTGCACTCCCTCTAACCAAGCAAGTGAAAGATCTGtATG 

ACAAGAACTACAAGTCCCTGAAGAAATAAAGTGAAGAAGATATCAAAAGA 

TGGAAAGATCTCCCATGATCGTGAATAGGTAGGATTAACAAGGTGAAACT 

GGACATCTTACCAAAAGCAATCTAGAGATTCAGTGCAATCCCCATCAAAA 

TTCCMCACMTTTTTCTGTAGACCTTGAAAGAGCAATTCTCAGTTTCAT 

ATAGGAAAACATAAAGCCCAGGAGAGCCAAAACAGTTCTGAGCCATAAAC 

GAACTTGTGGAGGAATCACCATCCCTGACCTTAAAGCCGCACTACAGAGC 

AGTCGTGATTAAAACAACAACAAAGGCTGCGCACTTTTGGTACAGAAACA 

GACGTGCTGACCAATGGCATCCAATCCAAGATCCAGAAAGAAACCCACAC 

ACTATAGi f i i i 1 1 1 lAAATATAAAGTTCTTCAGCTTAATGCTTCTCATT 

ATTCATGAGAGAAGAAGACTCAACAGCAAAGAAGGTGAAACAAGGGTGAC 

AAGTACCACAGGGCTCTCGAGTGTCTCTTGTGATGGACTAGGGAGCCCGT 

CAGTTCTGAATGCTCAGGAATGTGGTTCACAGTGTGGCCACAGTACAGAA 

GATCCCCGAGATAAGGCAGAAGACAGTCACCACAGGTCATCTCCACAGGG 

CAAGGACTCAGTATATGGCATATTACTAATGCTCTTAAATATTTACTGAA 

CAAAGGAACAAAATGCTGAGTCTGTCACAGAGATGAAAATAGCCGTTGCT 

TCAGGGGACAGCAGMGATAGCCTTTTTTTCTCCTTGAATGGTAGTTAAT 

TTAATGTTGCCTCTATATTATTAGAAATAAATTACAAGCTGAAAAATAAT 

FIG.3D(41) 
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GAGTCATAC6CAGT6ATTTCTCTT<XTTTAGGCTGTCTTTACTACAAACC 

CATTTCA6GCTAAATGATTTTGTCTTAATCACAGTCTAT6GTAATCTGTC 

AAGCCAGTTGTGACCTGTCTTCCTTTCCTTCTTCCCAGCATGCAAGTCCA 

ATGGGCACGCATCACTGTGCAACACCAACACGGGCAAGTGCTTCTGTAGC 

ACCAAAGGTGTCAAGGGGGACGAGTGCCAGCTGTGAGTACCACACACACT 

CTGTGTCTCCAGTGGGGGACTGGGCCTTGCAGCTGCCTGGGCCCTGTGGG 

CCACCTGCTTGCCTCGGCATTGTTGCCCTTCACTCCCAGGGTCTTTGAGT 

GGACTAGTGTGGAGGTTTACCTTTTTTCCTTCAGACAGGTTATCTCAGTT 

ACTTTAATATTGCTCTGATAAAACATATGACCAAGGCAACTTACAAAATA 

MGCCTTTAATTGGGCTTATGACTTAAGAGCATTGGAGTCTACATTGAGT 

TCCAGGGCAATAGAGCTACATAGTAAGACTGTATCAATCAATCAATAAAT 

AGGACTACATAGTAAGACTGTATCAATCAATCAGTAGATCAAGAGAAAGA 

MGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGAAGGAAGGAAGGAAGGAA 

GGAAGGAAGGAAGGAAGGGGAAGAACAAAACAAGCTTAGATAGGAAGAAC 

AGGATAGMTGMTGACAMTGCTTGAAAMTCTTTTAGCTCTACTTTTA 

GAAGCATACTCAATCCACACAGAAGTAAAAATGTTGTTCCTTATCAGTAG 

TACCTAGCATTATTACATATGTACTTGCCTGTGTCCTT-GGGCAAGTATTT 

GTTTATTTGTTGTTTTTATACTGTT€CTGGTGTAAATTACT€AGCA<5TTA 

GCAGAMCATTCCTGCAMTGGGATAtnCTCTCTGATCTCAATAATGATA 

TAGTTTATGTAAMGGATTTACTT<3GTTTAAAMTAAATATAGAGTCT"GT 

^mAAATCTCMTAGMGATAAmOT 

FIG.3D(42) 
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GAAAATCGATACCAGGGAAACCCTCTCAAAGGAACATGCTACTGTAAGTT 

T1TGTMTTGTTTCTAGAGAGTMTTGMCAAMCGACATTGCTTTTTTT 

TTTTACCATTGTCTGAGAATGATAAATGCTTGGGGGATGAAGCAAATACT 

CATAGCCATGCCCCTGACTTGGTGAACACTGTTCTAACTGAGGCATGGTC 

TCTGCTGGTCATCCAGAGCAGTTAGCAGGGGTGCTGTCCTGCCTGTCCTT 

GTTCAGCTCCCGCGGAGGCGTGCTCATTCACCATTGCCCAGTGTAGCTTA 

TCATGTCCAATCTTCAGACAGCCAGGAAGGAGTTTCTAAGATAGAGGTGC 

GTTCCACCATTCTCTCTGCAGCTGATTTGTGCTCACAAACAAGTAAATAA 

AACACCAAATTAATACCTTGGTGTGAAAGTGAATCTGGTAAGCTTACAGC 

TTTATCATAMTATATTTTTTGTCTATGAGAATCTACATAGTAGGTTCTA 

GACTATAGAACAATAAAAAAAGGAATTAACATTTGGCATATGCAGCATAA 

TGGTATATATAAATTGTAGAAGAAAATGGATGG7TCTAGACCTGAAAAGA 

CMGAAMTTGCTTGTGTGTAATCTGGGGAGGTCTTAAGTTGTGACCTTC 

AACATCTGCTTCCCAAGCAGCTGGAACCACCAGGCCTACAGAATTCTTAG 

CTATGA7TCTAAAGGTCATTCATCAMTATMTGTTAATGTGTATTTTAT 

TAMGTTTCAAACTTCTATCTTTAATAATCTGCAAATGTAGCTCAGTAGA 

GGAGAGCTCTCGCTGTAAGGTCCTGTGTTCTATCCCCAGCACAACAAAAC 

AAGACA77TAAGAAAAAATTAAAACAAG7TGGCTGTATTGTCTCAGTATC 

TCATCCTTGAGATAGTGAG^CAGGAGGACTTTTAGTTTGAGGCCTATGTG 

GGTTATGTAGTGTGAMCCTTTCTCAAATMTATTTACACTTTTTTCTTT 

AAAAAACAACTTTTTTCTTM7TTATGTGTTTTGCAACATGTAAGTCTGT 
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GCMTGTGMCATATCTGTTGCCTTT-GAATGCCAGAGAGGiaTTTCAGTTT 
TCCTGGATCTGGAGTTACCAAGGGTTGTGAGCTGCCATAGTGGGTGCTGG 
TAATGAACTGAGTCCTCTGGAAGAGCAGCCAGTGCTCTTAACTGCTGAGC 
CATCTCTGCTGCTAGGTACTCCCCCTTCCCCCCTTAAATTTAAGACAAAG 
GTCTCACTGTGTAGCCTCAGAT-GGTCTAGAACTGAATTTGTAGAATGGTT 
GACCTTTGMCTCACAAAACTCTCCCTGCTTCTGCCTCCTGAGTGTTGAG 
ATTAAAGTTGTATGTCACCACACCTGCCCCTATGATTTCTATATTTAATA 
MGATCATGACTAGGATATAGAGAACACTTTTAGAACTGAAGAAGAAGAC 
AGTTACAGTTAAAAGCAAAACAAAAACAAAAACAAAACAAAACCCAGAAA 
AAAAMGMTGAAMCTAGCACT^GAAAAMTAMTTTTAAAAATAGG 

caaagagtcactattatattgtgatggatgtgttatatgtttaaaaccac 
mgtgagatacaggcctcaaatgactttaatcgaagctacaccaggctgg 
ggtggtagttcagttggtaaagttcttgctatgcaagcacaagaagctcg 
gtttgatgcccaggacccatgctgaaacccaggagt'gctgctgagtgctt 
cagctctggggtggcagggctcactggca6gaaggctaggctaagaga6a 
ctctgtctcgaaaaacaaggccgatggcacctgatgaacggcatctcagc 
atgacctttgctcggcatataatgtgtacacacaaattcatagtttagta 
gaagacmgtatgatctgcttttcatgmgtctcttgtmtacgccttct 
ttagttmccatagttgcttaaaaamgaaaaaaatggacctcactggac 
agaamtggatagactgttctaatagccaattcaattcatcatca™tc 
aaaacctatmcttaggg€gctggagagat<3gctcaggg^gtaagagcac 
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TGACTCCTCTTCTGAAGGTCCT6AG7TCAAATCCCAGCAACCAGATGGTG 

GCTCACAACCATCCATAAAGAGATCTGATGCCCTCTTCTGGAGTGTCTGA 

AGACAGCTACAGTGTACTTACATAAAATAAATAAATAAATCTTTAAAAAA 

AMCACCTATMCTTAAACTTATCAATAACTTTAACTTTCCTACCCCATG 

CTTCCTAGTTACCCATTCTGCTTTCTGTTTGTATGATCGTGGGTATGGCA 

TC7TMTGGAACCACAGTGTTTGACTTTGTATCTACTTAATATTAGGCAT 

GATGCCTCTGACTCTCATCCCTGATATAGCACAGTTCAAAATTGCCTTTC 

TTTGGTGCTGTACATATAGCTGAGCGTTTGAGTGCTTCCCTGCATGCACA 

GGTTTC TGAATTC AATC CC C AGC AC AAAAAATGATAAAAAGAAAGC AAAA 

AGGCTTATTTTTACAGCTGGACAGATCATCCTGCATTGTGCCTGTCATGT 

TTTGCTTGTTTCTTCTGTCAGTGGACACTGTGTTACTTCTACCTTTTGGT 

TGTTGTCAGGAATATTGTAAACATGAGTGAATATACACCCAGAAGTACAA 

CTGGATGTGGTMTTCTATGAGTGTTTTGTTTTTTGAGGGATGGTTATTA 

TTGTTTCCATACMTAMTTACATTTCCTTACAGTTCATTACATTTCCAA 

MGCCATGCATAGCATTTCTGTTGTTCTACATTCTTATTGACACCAGTTT 

TCMTTTACATTTATTTTGTGAGTTTTTTAATTGGTAACCATCATAATGG 

ACATAAAAMTAGCTCATTGTAGTTTTGGTATTTCTATTTCAGTAATCCT 

TGGTGTGATTATCTTTTTATATTCTTATTMCCATTAGTGTGTATCTTTT 

TTTGGAAAMCACCTCTTCMGGG7TTTACTATGTAGCTCTGGCTGGCCT 

GGAACTTGTGCAGACCAGGCTTGCCTCCGGTTCCCACTGTCTTAGGTAGG 

TTTCCATTGCTGTGAAGAGGCACCATGACCAGAGCAACTCTTACGAAGGA 

FIG.3D(45) 
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CATTTMTTGGGGCTCGCTTACA6TTTCAGAGGTTTAATCCATTATCATC 
ATGGCAGGMGCATCGCAGCATCCAGGCAGATGT^TGCTCGAGGAGCC<3 
AGAGAGTTCTATATCTTGATTCAAAAATAGCCAGGAAAAGACTGTCTACA 
GCAGGCAACCAGGAGGAGACTGTCTTCCATATTGGGCAGAACTTGAGCAC 
TAGGAGTGTTCCAAAGCCACCTAGACAGTCACACAGTACATCCAAAAAGG 
CCACACCTATTCCAACAAGGCCACACCTCCTAATAGTTCTACTTCTCATC 
GGCCAAGGATACTCAAACCACTACATCCACCTACTTCTGTCTCCGGAATG 
CTGGGATTAMGGCATATGTTCCCATTACCCMTTTTAAACCAGATTATT 
ATTGI I I I I I I GTACAAC AGACTTTTAAGGTTAAAGTTTGC AGC AATAGG 
CATTCTTTGMGCTCTATCACACTGATATATGTCTGTTGTTTTCTTGCTT 
CCTAGATTAAMTAGTACAGTATATTCAAGmCAATTGTCCCTTTCCAT 
AAGMGTCCTGGTTTCTGTTC€ATTATTA€TTTATATCTTAGTG1TCTTA 
AGTAAAMTACTCAGTATTTATAGATGAGTTAGATTA<aAfiCCAAACCCCA 
ATCAGGGTATTGGTMTCAAiaGTTO^ 

MGATCTCGTTTCTMTiaGAMGAACATCTMGTTG^GCATTACTGt^CC 

ACACATCTGTATTTCTTATTCTTTCGMCCTT€GGCAG^ATAGACAGATG 

AGCTMGATTCCTTCATAGCTATT<^TTTGTGAG^ 

TTCCAGAAACCTGCTTTAGTTTCTATC 

TGGTCTGTGTGATCTCCCTCTACC^ 

CATAGATACCCTTCTCATT^CTATCAGT^^ 

MGA<H^CGGCTACTACA€A<aC€ATCM 

FIG.3D(46) 
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GTMGCTTTTCTTTTMGCTGTCTTATTTrGTGTTAMTTTTGTATAGGT 
i 1 1 I I ICTTGGTCATCCTGGACAAAAGTACTACATAGAAGCAGACAGTAT 
CAGGGTGGGAATATAAMGGCMCCAGTTTTTMGTATTITTTTATTTAC 
TTGTTGACAGTTTTATATGATTATATAATGTGCTTGATGATATTCAACCT 
GTGACCTTTTGTCTCCCTCATACTTAGTTCCTTCTCTCCCCACCAAGTCA 
CCTTCACTCCCTCTCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAA 
AGACAGACAGACAGATAGACAGAGAGACAGAGATTGATTGATTGATTGAT 
TGATTGATTGATTGATTGATTTACCTACCTAGTTTACCAGCTGACTGCAG 
GAGCATGCTGGGTGGGAAGTTCTTACTGGAGCATAGACACATTAGAGTGA 
CTACACCACTGAAGAAAGTGACTCCCTCTCAGGTAGTCTTCACTGCCACT 
AGGTCCTCAGGGATCMGAGMTGTrrGGAGTCTACATTTTATCTTTTTT 
CCACTCAGAAGGCAMCATTACTGMTGTTTTTAAGTAGTAGAATAATGT 
TCATGATAGTCTGTTTMTATTAMTTAAGMTTTGTTCCTAATTATAAA 
ATTTTTAGAAGATAGACAAGMGACAAMT7T7TGAGTTMCAGTTTGAA 
AGGTTTATTrTTATTTTATTTTATATGTATGMTATTTTAGCTTCTTGTA 
TCCCTGTGCATCATGTGTGTGCAGTGCCTGTGGAGGCCAGAAATAGATAT 
TGGATCCCTGGAACTAGAGTGATAGATCATTGTGAGCCATCATATGGGTG 
CTAGAACCAACCCAGGGTCCTCTGCAAGAGCAGTGAGTGCTCTTAACTGC 
TAGGCCATTTCTTTA^CCCTAMTGT<^CAACTCTTTAMTAAATGTA 
AGTGATCTTAMTACTCTGGAGAAAMTCTGTAGCTATACCTTACTTTTT 

FIG.3D(47) 

SUBSTITUTE SHEET (RULE 26) 
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AAAMTTATTTTGTTTTATATTATGAGTGTTTTGCCTACATATATGTGT^G 
TCTGATGCCTGCAGAGGTCAGMGAGGGTGTTGGATCCCCTAGAAC'PGGG 
GTTACAGATGGCTGTGAGCAGCTATGTGGTGCCTGGGAGTT'GAACCCTGG 
TTCTCTGTTAGGGCMCMCTGCTTTTAACCATCAACCCATCTCTTT'GGC 
ACATGGGTGCATTGTTGGTTTGGCTGCTTGAGTTGTGTGTGAGGGGTGTG 
TGTGCATACATATGTGGGTCCATGCTTATCCAGTGGAGGCCAGAGGT-CAG 
AGTCATGTATCTCTCTGTTACTTTCTACCTTATGTTTTGGAAGCAAGATT 
AGATAGACCCCTGGGACCTTGCTGTCTTCTCCTCAGCACTAGGACTACAA 
GTCC AC ACCTGACTTTTTAC ATGGGGCTTCAGATCTAACTC AGTCCCAAC 
ACTTGTTTCATTTCCTTAGCACCTT^GCTAGATTCTTAGGATTTTAGAAG 
GAGCTTATAGCAAAATACCACAAGTGAAATTTACTACTGCCTTAGTCATA 
AGCAMTATTGMGGCTCAGTCTTTMGGGTATAATTGATAGTGTTCTTT 
I 1 1 I 1 1 I lAAGTAAACAAATAGCCTGTCAT'GGTAACTATG'GCKaTAGTCC 
CATTACTTGTGAGAGATGTCAGCTCAAGGCCAGCCTCCGCTACATAAGTA 
AGGGAAGACCAGCCTGAGCTATATGGGACTCTATCAAAACAAATAAACAT 
TGTAGAA I I 1 1 I GTAATACTTATTAGAAGGTAGCTGATGATGATGAGAGT 
CmAGACAmCTTCATTCCACTGTm^TCT«TGTGTCm<:AT^CA 
GATTTCTOCTAGAmATCTCmGTGT«TGT<3TCTGTGTCTCTCT13TT 
TTACAAMTGACAMGATnTAGTCCTTCTCGTGGAAAGTAGTTGCTAGT 
GGTCAGCAGATACTT^TAGTATAMTAAAT<aAGCATAGATCT43C^CTT*G 
CAAAGGAAGACAAAGGGAAAAMHaTTWCT^^ 

FIG.3D(48) 
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GTGAMGAMCTTCTCATTTGGAMTTACATTTTGAAAATAGGTATTGTG 

MTGTTTCCATTGTGGTTTGTGGTATMCTATCAMTMCACTTTTTTAA 

AMGAAAMTCTTMTTTTCTMGATTTTTAMTACCCTTTTAAAATGAG 

CAmCCAGCATGGmGATTAATTTGTAAAATGTAAGAATATAGTATCT 

AAGGCTACAGAAATGACTCAGTGGTTAAGAGCACTGGCTGCTTTACAGAG 

GACCCAGGTTCCATCCCCAGCACCCTCATGACAGTTCACAGCCATCTGTA 

mCTAGTTCCAGGGCATCTGATGCCCTTCTCTGATTTTCTCCAGTACTA 

GTGACACACAGCATACATTTGAACAAAACCACTGATACACATAAAATAAA 

TTGTT7TCAAGAAACAATATAGCATCTAATTAGCTTACAAAACTAA7TAT 

TTGTTTCTGTACTAATTACGTTTCTATTGGCATGACTAAGGCAACTTATA 

AGAGAAAGCATTTMTTTGGGGTTCACACTTCTAGTGCCTTAGATTCTAT 

GAGCATCATGGTAGGGAGTGTGGCAGTAGGCAGGCAGGCATGGTGCTGGA 

GCAGAGGCTGAGAGCTCACATTTGATTTTCTACTAGAAGACACAGAGAGA 

GCTMCTGGAAAAGGCATGGGCTTTTCAAACCTCAAAGCCCCCCTCTAGG 

AACACACACCTCCACCAAGGCCATACCTCCTAATCAAACAGTCCTACCAA 

CTGAGGACTAACCATTCAGAGATAGATGAGTCTATGGAGGCCATTGTCAT 

CCAMCCACCACAGGCCCCMGAMGATTTGTTAGTGAAATTTCAGTGAA 

AACTAAAACAGCATTAGAATTTACCTGGCATAGCCAGCAATGATCTCTTC 

TGTTCAGTGCCACAGATTTCTTTGAGTTAAAACTCAGTTGTTAAAACCAA 

AAATCAAMTGTMTTGGCACTTTAMTTGCTATAAGGGGAAACAAGGTT 

TTCAMGCCATGAMCCATATTCAGMTMTTTTAGCGAGAGAAATATTT 

FIG.3D(49) 

SUBSTITUTE SHEET (RULE 26) 
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TTTC I 1 1 I 1 1 I I GTCGTTTC I 1 1 I 1 1 I I CTTC6AGAGAAATATTTTTATT 
ATTTTATATTATTTTMTTACATATTTMTTATTAACCATTTCTGACAGA 
GGGCAAAAGGTGAGGATCTTCATGGAACTAATATCT6ATAAAGCACCAAA 
TTCTTCCCMCTCTGGGATGCAMTGACAGTTCMCTTCAGTTTATTGCT 
TGTATTGAAGAAAATTGACAAGAAATGTCATGTCTTAACATAAGCATGGA 
TTTCTTTTMGATGTAGMTAGTCTATMTTMTGTTTTTGAGACTAGTA 
AGACCTGATTATTGTTGTATCTTAAMTCTAGAAGGTACTAACAATTTTC 
TAATGTGTA I I 1 1 1 1 1 I I I CATCAGCAAAACAGGGATTTGGACATGTTCA 
TCAATGCCTCCAAAAACTTCAACCTCAACATCACCTGGGCCACCAGCTTC 
CCAGGTACAGACACACCTAGAGAGATGGATTGGCAAGTTTAGTGTAGGAG 
TTGGGGAAGGAGGCTCTGAAGGCTGGTGAGTGAGTTCAGAGCCCACCTCT 

gcctcttagtagccatg^acaccttgaacaagccatccttgaacaagcatg 

tacaattccctctctaccttaggctactcagagtgaggagtcacagctct 

tgcctccagcgttgctggmaggttggttgga^ 

gccaccaccttccagcactatgactatctctatgtttctgcttcacaggg 

gaaamctamgtgactcatagttttaagaaatgaaaactctttaaggga 

agggggatmctctmtatctaga^tattcatactttgggataactcct 

aaaagtacagcttttccattcttgtttatcttatagtgactataaaattc 

tgatggccctmtgtagcag™ctatamtaagcactc<:ataacttgat 

agccctgmgatagacctaggtngmtttagct^acggtgttgmcm 

gttactgmg^tttcttttctttgttttttaagt^ 
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GTGTGTTTGCCTTTGCCTGTATGTGTATAAGTGTACCATGTATGTGCAGT 

gcttgagaaggtcagaagaggacatcagctccccaccctcaacgag™c 

AGACAATTATGAACTACTATATCTGTGCTGGCAACAGAACCCAGGTCTTC 

TGAAAGAGCAACCAGTGCTCTTAACTGCTGAGCCATCTCTCTCTAGCCCC 

CMGTTACCTAMCTTTCTGATCCAGTTTCCTTCTTTATAAAATGATACA 

GTGAAMTAGCTTTGCTATGTACAGAGATATTCCMCTTTTTAATATTAC 

AACATGACATCTACAAATATGTTAGCCCTCATTCATAATCTTGCCTGAAT 

TGTAGAGTGTTGCMGGAATAAATGAAATAAAGGAGGTAC7TATTATAGA 

GTTTGAGG7TTGCCTTCATGCATAMGAGMGCTTTTTTGAGTCTGTACT 

ACTCATGTTCTTAGCCAATGGAGTATATAAAATATGGTAGAACCATTTAG 

AAATGGAGTCTCACTGGGTACAGGCCTGAATGCAGTGGTAGGAGGTAGCA 

GAAAGAAGGCCTGAGTGGCTGCTTGAGCACCTTCTCCATCAAGACTTGAG 

GACCTTTCTGCTTAGGAAGTGATGAGCGAGTAAGTGTCCCTGAACAGGAG 

CCTTGAGCATATTCTACAGTGTGAAGCAGAAATACAAAGGAGTTGAGGTA 

TCATGTGCAAMTGAATGCAGTGTCTGTTTTATATCTATGATTGTT7TAC 

ATACATGTATGTCTGTGCATCGCTTATATATCTGGAGCCTCTGAGACAGA 

TTACTTMTCTA7TGGGACTTGAGTTTTTCCAATCTGTAGATGGAGATAG 

GAAGGTGTTGTGTGGGTTAGAGACTGAAGCTCATAAGGCTATATTCTTTT 

GACACTGTMGTGCTCAATAMCTTTTACCCTCATTACTAGTGCGCAAAG 

ATTCTTTCTGATTGGCATACCCGCCTCCCMGTCTTTATTTTTATTCTTC 

CTTCTTTCTAGCCGGAACCCAGACTGGAGAAGAGGTGCCTGTTGTTTCAA 

FIG.3D(51) 

SUBSTITUTE SHEET (RULE 26) 
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AMCCMCATCMGGAATACAAAGATAGCTT€TCTAATGAGAAATTTGAT 
TTTC6CMCCATCCAMCATCACTTTCTTTCTTTATGTCAGTAATTTCAC 
TTGGCCCATCAAMTTCAGGTMGMCTGCTTTTTMCTTCATT42GCGTA 
MGATGGTGACATCTCTTTAGTG^GACTMCTTCACTCATTTGGAATCT 
GTGGTGACTGAMGATAGTGT?GCTTTGCCTTT^GGGATCTTTGCCATA 
GACTGAGTAGCAGGTGAGTGCTGTTCTTAGGTRaGAGAGATGTTCAGTGA 
GTGGAGTGCTTGCTACACAAGCCTGAGGACATGCAGTTCATCTGCAGCCT 
CTCATACAAAGCGGGACACGCAGGGTGTGCCTGTCACCTCAGCACT-GGAC 
ATGCAGTGTGTGCCTGTCACCGCAGCACAGGACAGGCAGGGTGTGCCTGT 
CACCTCAGCACTGGACATGCAGTGTGTGCCTGTCACCCCAGCACAGGACA 
CGCAGTGTGTGCCTGTCACCCCAGCACTGGACAGGCAGTGTGTGCCTGTC 
ACCCGAGCACTGGGAAGCAGGGGACAGAAAGATCTTGCTTGCTGGCCAGC 
CACTCAAAGCTGGATCTGTGAGTTCTAGATTCAGTTAGAGACCGTGTCTC 
MGTAAMTMGGTAGAGAGGAATTGAGGAAGACAGCT<aATTACCTGT^3G 
CTTCTGTATGCATGTGCACATATATATACCTTCACACATATACACACTCA 
GAGAAAAMTTCTGAGAGTGTCATATCACTTGfGAAGAMGTTTTAAAGC 
ACTTTTAAAAGCMGATGAAAGCTATCCAAGGTATGCAAGGTAGTATACT 
TTTGTMTCCCAGGATGTGGAAGACCMT^AGGAGGATCACCCTCAGTT 
TGAGGCCATAGGAAGACCCT€C^TCAAAAGGAG€GAAGGAG€GA6GGAGG 
GAGAGAGAGAAAGAGAAAGAGAAAGAGAGAGAGAAAGAGAAAGAGAAAAA 
GAAAGAAAGAAAtaAAAGAAGGMGGMGGAGAAAGAA^ 

FIG.3D(52) 
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GGCATATAGTTATGTCTTTATTTTTTGAGTAATTGCTATGTAAAAGCCTT 

TAGAAATACACAGTTTTAATTATGGAATTGAGTATAAATAAAACAAGTAC 

ATGTTTGTAACCAATAAAGTATAAAAATGACACATAAGATGTCAAAGTGG 

TATGATGGCTATAATGTGGAGTCCATAGAGGAAGCAGTAGGCAGTATGAG 

GTACTGTGTAAAAACACATAGCTTTACTATTGCACAGACAAGTGTGGATT 

CTTGTTCTGTGTGTGGTTCATGGAGGCTCTCCAGTTTGCAGATTCTCTGT 

GCATGTGTCCTGAAGGATTGGTCTTCCTGCTATGACCTCTGGTGTTATTA 

GCCTGAACTGAGTCCTAAGGAGACAGGTAGTGGAAATGTTTGTATreCAA 

AGACAGTATGGGTAGTTGTTTTTAGAAACAGGAGTTCAACAGAATTGATA 

GAACTTGTGATCAAGAAGCTAACAGCTGGACTGGGATGTAGCTCAGTTGA 

AAGAACGCTTGTCTAACATTAAGAAGCCCTGGGTACCATCACTACCACAG 

CATAAACTGAGAGTAGTGACAGACTCATGTGTCCCAGCACTGGGAAGGTA 

GAGGTAGGAGGATCAGAGGCTGCCCAGGGAGGTTGAGAGTGACTTACGCT 

AGGAGATAGATCTAAAAATGAAAAGGAAAAAGAACTTGGTAGCTGCTAGA 

GCTACCATGAAGAGAGTGGAGCTTAAGGATTCAGCTGAAGAATGTAAACT 

GCCTTCTGATGACAACTGAGAGTCGCTGAGTTATTTAAAGTCAGGAAGTG 

MCAAAGATCAGTGTTTCAGAAAGACCTCTGTGGCAACAGTATTGACTAG 

AAGTAGCCCCTCCTATCTCAGGTACTCGTTTAGACTGTATTTGGAAGTGT 

CCTCTTTCTTGATGGCCCTCAGACACCTTTGATGGCCACTCCTCTGCATT 

TGTACCCCATAiaCCACACACTTGAT^TTCTTTATTACATAAATAGCTCC 

TTATAGGCMTGATAGATTTTATATTTTTGATMTTTTAAGATAAACTCT 

FIG.3D(53) 

SUBSTITUTE SHEET (RULE 26) 
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ATGTCATTGCATAGAATTTAGTAGTTGTAGGTACTCAGTAAATGTATATA 
GGATGMTACAAMGCTTTAGGGTAACAGTAT7TTGTTCTTCTTCCCGCG 

catttttaactatctcatagtagcacagactaacccataactgaccatga 
agccaaggatgaccttgaactcctgtaccttctacctgttccccgaaagt 
gctgmg™ctggcatgtgctcctcacccmctmta<xmgtttttct 
tatamggtgctgatgccctttccctgtttgtgttaattgctgacactta 
amgctctttatcccmcccacagtgttamgagtttagttaaattttgt 
ggamttttgtccgamtgmgtcgttgat<^aggcctggtggctcctt 

CCTATAATTCCAACACTCAGGAGACAGAGTCAGGACGATGGCCAAGAATT 
CMGGCCTTGGGCCTACAGAGTAGAAGAGAGAAGAATGAGGATTGGAACA 
CCTGATTAAATAGATACCATTTCCTGCTACCAACCTGTGCCTTAGCTACT 
CTTCTATTGCCGT^CAAMCATCATACCCAA^GCAGCTTATAAAAGAAA 
GCATTTATTAGGACTCACAGTTTCAAGGGTTATACTCCAAAACCATCATG 
GCCGGGAGCAGGCAGCAGGCAGGAACATCTGCTGTGAGGAAGAGCTGAGA 
GCTCACTTCTTTATCCACAMTAGGAGGCAGAGAGAAAGCTAACTAGGAA 
TAGMTGAGCmGCAGACCTCAAAGCCCACCTCCTTCCCAAACATTTCC 
ACCAATTGGGAACTAAGTATTCTAATCTGTGAGCCTCTGGAGGCCCATTC 

ttatttamctaccacactttatmgttmtactacatgtgat^ggaaa 
ctggtatgggmtotgaamgtagttcacaggagtgggaggggctgaac 
gtgagtagatgctagcatgtgtgtcaggagt^aagtgtoagagcat^ 
ctggtttgacttctctccagagctgaggtgaacatcctttgt^ 
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AAACCCGTATTAAAGCGGTGGTAGTTACT6AAAATCAGTGCA6GGCTGTG 
GTCTCAACACAATGTTTGAAAAAGAAAACAGGGCATCCACATCAGGCAGT 
GTACAGCTGCTTATAATTCCAGTCCTCTGGCCTCTGCTCACATGCACATA 
CCCCCCCATACATACACACATGATTAAACATAATGAAAAATTAAAAATTA 
ATGCTATAAAAATGGAAAGAGCCGGGCGTGGTGGTGCATGCCTTTAATCC 
CAGCACTTGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCGAGGCCAGCCT 
GATCTACAGAGTGAGTTCCAGTACAGCTAGGGCTACACAGAGAAACCCTG 
TCTCGAAAAACAAAAACAAAAACAAAAACAAAAAAAAAAGTGGAAAGAAA 
. GGTTCACTGTTTCACAGGAAAACTCTGAGAGGTGATAATCCAATCCCAGT 
TTAAAATATACTCCATAGTGCACACAGCCTCTCCCATCCTTGGCAACTGA 
GGCCTGTGAGAAGACTCAGTCCTCTCCTGGCTTCCAACCTTACAGTGTTC 
AAAACTCTTCTGCAAGATCCACATGGTCCTACCAAGACCCTGAAGGTCAG 
GCATGCTGATTAGGCTGTCTCTGGGCCTGAAGTGAAAGGTAAACACTTCC 
GAGATCTCCAMGCCTTGGGAAGATTCTGAAATGTATGGGTGTTGGTTCA 
GGTAGACTCTCAGCCTTGGTGAAGCTGCCCCCGGAGCTGTAGGGTTATCT 
GCAGAAAGTCAGCCAGGTGCACTTACCCTGGAATCCTCTCCCATTCACAG 
ACACCTCCCTGAGGCTTTGTGGCTTCACCTCACTGTGCAGCTAGCTCCTG 
TTTTACATGCTTATATAATGAATGGTCTTGGTAAAGAAGATGATAAAGGC 
MGCTAGAGGCCTTTTTTTTCCCCTCTTCAMTTTTGATTGGCCTTTCCC 
TACTGTTACACTGTCTACTCMGGTnTGAGCATTTACTTTGTGTACATA 
GTAAMGCAAAGTACATATTTTTAAGTAGAAAAGAAAGCATCTGTGGTCT 

FIG.3D(55) 
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TT6ATATAGGTGCTTTTCTTTATTTTAATAGTAATACTTATTCCATGCTT 

GTTMGAAATTCATTCACAGCGTGTTTTCATAGAGACTTTCTCTATAGAG 

ATATATAGAAATCTAGACATGAGGACAGCCCACTAACCCACTCTTCAGAC 

ACTAGCTGCTOTCTOGAGCCCTOGGCTCTCACCCTTOGAGGACAGCC 

ATCCTCACTCATATGTGACAAGCTTAGACACAGAATAATCACAGAGACTC 

CAGCCTCCCCCACAAACCCACAATGCCAATATGCCATATTCCCAGGAACT 

TTTAATAAGCCATCCACTCTAATACTCCATCTCTTATCTCAGGCATAGGC 

CCTGGTTTTGGTTTGCTTCAGAGTACTCCCTTTTCTCTACCAGGGCCTTC 

CGACTCTTTGCTGACCCTCCAGAGATGTCATTTCCAMTCAAG13GGGTTT 

TTGGTTCTGTGGGTGTTTFtnTTTTCAGT^^ 

AGGGGACGGAGCAGGCAAACCAGATCTCTAACTTCTGAGGCCTGTGAAGA 

GMGCATCAGMCCTCGCAGG<3GAGCT«GTAGGAGCAGGAGTCAGGeCTAG 

ATATGACTGTGAGAGAGTGGGGACCATTAC€AGTGTCTTACAAAT'GAGGG 

GMGGACTACCGTGCTGGGCCCTGAAAGATAAGGAGGACCAGGCTTCAGG 

MGGTAGGACACATTCTCCT^CTGTCTCGGATTGAGGACAGTAACACAA 

CTACTTAGACATACTTTGMTCAAGGACAGACTTAtaTCCTTCAGAACT^GT 

AMTCCATOTATCTTTCCCAAGTCTTAGGCTAGCCAAGTTTCTCAACAT 

TTATCTACCTCATCCCAMGIaGTTCCCAIaGACAMTATTTCTTACTCAAA 

CAmGATGGGAGTTGGMTCAGGTTGAGGAMT€CAGG<3GT<GTAGATTT 

TAGATTTCTGGGMTATGTATAGATAGCTAGCTTCTGT^GATAGAAAAT 

GAGATTGTMGTTTTT-CAGTG 1 1 11 11 i ACAC<3AGTTTGTGT3GCCCAT«GT 

FIG.3D(56) 
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ATGCACATCTGGAGGCCACGGGTCTACCTTAGGTGTCTTC7TCAGGAACC 

agccatcttatttttaagatgatctctctccagacctcagggctatcaac 

ACACCTCAGGGATCCATCCTCCTGACTGTATGTCCCTAGCATTTGGGTTA 

CTGTACCACCATGCTCAGGTCTTTGTGTAGGTCCTGGGGATCACAGTTAG 

GTTCTCATACTGCAGGGCAAGCACnTGTAAACAACTATCTCCCCTCCAT 

ATGGMGTATTACCACTAMTTACAACAAGATTTTCTTCTATTAAAATTA 

TATTTTAGAAGCTGGATATAGTAATGCGTTGGGGCAAAAGGAGGGAGGGA 

AATGAAGAGGATAGGAAGAGGGGGAGGGAGAAGGGAAAGAGTGGAGGCGG 

GATCAGAAGTCCAATGTTATTCAAGGGCAGCCTGACCTAGATAAATCCCT 

ATTAAAMGTmCAGTATAGAAACTTCTCATCACCTTCATTATCAGAAA 

AGCCCCTAAATTCAGAACACTTTTTMTCTTMTTAGTTGACAATTTCAT 

AMTGTATTATTrATATATATGAATMCATTTTCCTCCTACCTTTrTTTC 

CCTTCCCCTCTGATGATrCCCATCCTCCCAACCAAGCCCCCCTTCTGCAT 

^GTTTGTTGCTTTAATGACCCACTGAGTTCCATTGGGCTCACTTCCATG 

AGTGTGACTAGAAGAGCTATTTATCAGAATGTGGGCAACTTACCAGTAGT 

GACACTGATGMGAMGTGTTTCCCTCTTACCCAGTAACCATTAATGGCC 

AGGAGCTCCTGGGAGGGGTGGGCGCCTTATGAGCCCCTTCTCCAAAATGC 

TTTCAMCTGTGACCAGCTATATTTMTGTTTTTATTATCCCTGTGTATC 

CATGTGGGACAAGAAAGCTTGAGAGTATCATAGCATGCATGTGGAGGTCA 

AAGAACAACTGTGTAAAGTCAGATCTCACTTCCCACCTTCACATGGGCTC 

TGGCACTGAACTCATGTCAGTGACCTGAGAGGCACTTTATCCTCTAACAC 

FIG.3D(57) 

SUBSTITUTE SHEET (RULE 26) 
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GCACCCTGTGCCCAGCCTAAMmGACCTTTGCAAGGTTTAGTGTGTGT 
TATCTGACTGTCTGAGTAAGGATGACAAAATGAAACCAAACTTATGGGAT 
AMGCTTGGTGGTrGTATCAGTACATTmATOTTGTGATAAAACATTA 
TGACCMGACAGCTTATAGMGAGTTTATTT^GTCTATAGTTCCAGAGA 
GGTAAGAGTCTGTCCTGACAAGGAAGCT*GTGGCAGCAAGTGGCAGGTATG 
GCTACAGGAGCAGGMGCAGAMGAGCAMCTAGAMCAGTTtaAGGTTTT 
TTAATAGGAAAGCCCACTCCCCTAATGATGTCCTTCCCCTAGCAGACCAC 
AAGTCCTAACCCTCCCTACACAGCACCACCAGCTGGGGAGTTCAAATGTC 
TGGGACTGCAGG6GACATCTCATTCAGACCACCTCAGTGGGAGAATGCTT 
GCCTTCATAGTATGTGCAAGGCCCTAGGTTCAATTCTAGCCAAGAAAAGA 
GMCATGAGGAAAGAAMGMGGTGGGAGAGAGTAGAGAAAGAAGAGAAG 
AAGAGGAAAAAGGAAGGGAAGGGGGAGACAGAGGAAAGCAGGGAAGCAGA 
GGAGAGGAGAAGAGAAAGAAAAGATTAACCAGCCTGG 1 1 1 1 1 AATAGCAC 
CCCTCCCACTCTCAGTAGTTCCCMTTTGAGCATTAAGTTCAAGACTGAT 
AGATATTTCTGGGTGGGTGACCAGTGTGGTCATAMCATjSGT^CTTTTG 

ctctccgtacmcttgtga™t<^cttcttagatgatcagcttcaaca 

ggagagggcctcctttagtctcafigt^cccctccagccaccctgggact 

cgcagcctctctctgatga€acaca<^cattaact<3gtat^ttct<3gt 

ttgccaamcgtcagtccat<^tt<^ctctgcacmt<aagaaagaagct 

ttgagmtcattacat<&catca^ 

aaagggccaggagcatcggcaggttt^ggcagccagtactaactagtaaa 

FIG.3D(58) 
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AGCACCTCATAGGTTTCTTTAAAATGCAAACACTAAGGAAAATCTAACTT 
1 1 1 I 1 1 ATTTATTMGGCCATTCATTTTATTTTATAAGTATTTTGCCTGT 
ATACATATGTACCACATGCATACAAGGTCAAAAGATAGTATTGGGTCTTC 
GAACTGGAGGTACAGATGATTGTGAGCTGCCATGTGGATCCTCGAAATTG 
AACCTAGGTCGTCTACAAGAGCAGGAAGTGCTCTTAACTTCTGAGCCATC 
TCTCCAGCTCCAGAAAAGCTACTCATAAAAGTGAAATCTAAGCCATGTGT 
CTGGTGATGTACACCTTTAATTGTAGCACATGGAAGGCGGAAGTAGGC^G 
ATTGTTATTCATCCAAGGCCAGTCTTCTCTTAACAGTGACAAAAACAAAA 
CCAAACCCGAAACCTGTTACTTTGCACTTTAGAGTATAAGTGATAGAGAA 
AAGACACAGAMTTTTAGAATCTATACCTTAAAATACCTTATGGCTTATA 
TGATACTGTTGGGACCATATTTACTTATGGAATGCAAAAAAAAAAAAAAA 
AAAAAAAGATGGGGGGGGAGCTGAAGGTCTCCTTTCTATTCTGTTGTAAA 
TCTAGCTATAAAAAGAGTAAGAGGCATGAGTGTGTCTCAGTGGTAGAGCA 
CCTGCTTAGCTTGTGTGGGATTGAATGATCCTCAGGACCACAGAAGAAGG 
GTGGGGCMTAAATTTAGGAAAATAAGATGCTAATCATTGACTTTCTTGA 
mill I AAAAAMGTTATTATTTTATGTTTATTGTATATGTTTATATTT 
TCTATGTGTGTTTTTGATGTGTGCTGGAGGGATGGGGGCCACTTGCTGAA 
CTTCCCAATTG7TATCATAACTACCATCTTTAGTGAAACAG7TACCATCT 
ACTTAGTMTTGTTTCATTCGAATAGATACTGAACACTCTTAATCTGAAA 
CTAATGCTCAGAAAGTTCCACTTTGCCAAGCAAGCAGGATAATGTAAGCC 
TATMTTTTAGCACTGGGAGGGTGAGGCAGAATTGTGAGCTCAAGGGCAC 

FIG.3D(59) 
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CCTGAGCTTTTGAGATCCTGTCTCAAATAAAATTAAATTATATAGATATC 

AGATTTTCA6AATAGGTGTGTTCAGCTGCTGAATAAATCTAAGCAAATAT 

CCCCAAAGMCCCTGAAATCTGAAACGTATTAGTTCTAAGCCCTATGTTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

TMGATAGATTCTCACTATGTMCCCTAGCTTGCCTGGATCTTGCTATAT 

AGAGAGACCAGGCATATGCTATCGTCCCTCGG^TOCAAAGGTTTTAGA 

TGAMGATTTCAGTTGTACCATTATCTTCCTAATGAGGGCTCTGGTCTAG 

TGAGGCAGGTGACATTAtaGCCAGTAGTMGTATTAGGAATTGGTGATGAC 

GGTCAATTCTGAGACACACAGTAGATACATCTAATCTACCAATACAACCA 

ATGATTTAGAMGAATTAGGCCATAGTTAMTTTK^ 

CCACAAMTMTGTTACTTCTTTCAGTTCTTAGTTCAAATACAGTAGGAA 

1 1 1 1 I lA TATTCTTGGTGCTAMCACTATTATTTTATAGTAAAGTTAGTA 

AGATAGAMTGACGCCCT<aTGGGTTGTCTO1aTCGTA<3TCTCTAGCTGAGG 

CCATTTTGCTGAGAAGCAGCGTAG^CTGTCACTGG^TTTCTCACCCATAT 

TTTCTGTATTTTTGCTCCA<aATT43CCTTCTCCCAGCACAGCMCTTCATG 

GACCTGGTACAGTTCmGTGACTTTCTTCAGGTAATTTCTCTATGCTAA 

TTGTACACATOCATCGAGACAGTCCCTTMCTGCAGCTTGCTTTGTATA 

TCCCTACAMGCTGCTTTOACTCACAGT^^ 

GATAAMCTCTCCGm43TATGATTeG13CTCm^AT<3GG^<^6Gm 

GGGCTCAA<3CAGTTATTAATAATATAGCTACTCCTGTGAGCTACATGTCT 

TMTCTCTCTTAATCAAGATAT^^^ 

FIG.3D(60) 
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^TTTACTTGCAMCTCCTGGGGTTCTCCTTTTTTTATAGTTTTCTTATT 

AGTAGGG I I I 1 1 1 I I I I I I 1 1 GAGAATACTATGCAGAAATGATTGAAAAG 

AACAAATTAGTCATTGCATATTGGTAAGAGAAGCAGCAAGAGCCACCTCA 

CCTCCCTCTGCTCTCCCCAAATAGAAACTGCTCTGCTGTGCTGCTTCTCT 

ACCTTCACACCAATGCTCGGCCTGCGAACTCAGTTATCTTTCCTTTCCTT 

TTAAGATAGGGTCTCTCCTTATAGTAGTTATGACTGTCCTGGAATTCTAA 

ATAGAAGAGGTTGGCTTTCAAATCACAGATCCTCCTGCCTCTGCCTTCTG 

AGTACTGGAAGTATGGTGTATGGCACCGTGCCACAGCTAACTCAGTTATT 

TTTTGGTGTTCTATAACTGCCTTACATACATACAGACCAGGTACACACAA 

AATTC C TTTC C ATTAATTTAATAGTTATATC AC AATGC ATTGACC AACTA 

AAAAATCCTAMTTGACTTATGATTCTACTTGCTCATGTTTTAAAGGAAA 

GGTTACTCTTTGCTTATCTTAAATGTMTATTTTTCCTTTGCAGTTGCTG 

TTTAMTTTTCCCTATAAGTCGACCCCAAATTTACATCTATAATCTGGCA 

AAACAAAAAGACCTCTAGTGATGGTTGTCTCTTAGCTTTAGTCTCTCTTG 

GACTCCATTCCCTCCACCCATAATGTTCCATCCTCTGTCCTTAAGTGTAC 

TAGTCTCCMGGCCTGCTATGTGGTTGTCATTGTreTAGTTACTTTTCTA 

TGTTGTGACAMGCACCCTGACAGTGGCMTTTAGAAAGCATATAATTTG 

AGGATCACAGTTCCTGGTTAGAATCCATGACCATCTTAGCAAAGGCAGAC 

AGGCAGGCCTGGCACTGAACAAGTAGCTGAGATCGTCCATCTGGTCCACA 

AGCATAAGGCAGAGAAGCTAATTGGGAATGGCATGGGCTTTGGAAACCTC 

AGAGTCCACTCTTAGTGATACCTCCTTATCCTTCCAAACAGTATTACACA 

FIG.3D(61) 

SUBSTITUTE SHEET (RULE 26) 
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TTCAAACTTCAAATGTGTGAGCCTCTGGGGACCACTCTCATTTAAACCAC 

CACAGTGATCTTGGCMCTTCTTTTGTGTTCGTCCCATGCCACAGTCTTT 

CCATGTATTTCTCCTTTTGCTGGMCTTTTTCCCTCGAAG^TTCCTGAtaG 

AMGAAACATAGATAACTTTTGTATGTACTTCTACAACTGAAAGTATCTT 

MTTTTTGCCCTMCAMTTTTTGTTTGCTTACTTGCTTGCTTACTT^ 

TCTGCGTGCATGCATTTATTTGTTTGTTTGTTTGTTTCTTTGTTTCAGAC 

AAGATCTCTCTTTGTAGTTCTGGCTCCCTCAAACTCAGAGAGATTCATCT 

GCCTCTGCCTCCAGAATGCTGGGATAAAGGCATGCTCCACCATACCTAAT 

CCMCCTCACMTTTTTTMGTGT€TATTTATATGTGTGTGTGGTATATG 

TAAAGGTGTGTGTGTTCATGCACACATGTGCAGAGATCAGAGGAGTCAGG 

TTTTCTCATCTATCACTCTCTGCCTTATTATTTTGAGACAtSGGTCTCTT'G 

TTCGATATTACATATACTAGGTGAGATAGCCCAGGAGCTTCTAGGMTTC 

TCTCCCATTTCTACCTTCCAMTGTCTGCTACTCCATCTGtaCTTTAAGCA 

AGTTCTGGGAATCTGAGGTCAGGTCCTTACACCTATGTAGCAACTCTGCC 

TACTGAGTCATCTTACTAGTATTCACAAGGTCAAAGGTTGGGACCAACAG 

CCAAGGTTGTCCTCAGATCTCCACACAGAWAGCCACAATTATACAAAC 

ACTCMCATAAACCTATTTACACAC€CACATCACA€-GCACACACATACAT 

GCACATACAAAAAMTGCTTTTTGAAAGAAGTAGAGMT^TA^TAT^ 

TATTACACGTATATMTCCMGCCACTCT<3GAAGCTGAGGCAGGAGGATT 

TCMGTTTGAMCCACCTTGACCACATMTTATACCATCCCTCAAAAATT 

GWA<3AGMTAA<^TGMTAT<Wtf^ 
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ACTTTTCTATTGCTGTGGCAAAACACCATGACAAA6GTAATTTACAGAAG 
AGATTATTGGGGCATATAGTTTCAGAGGGTGAGTCCATGACAATTATGAT 
ATGGCACTGAAGTAATAGCTGAGAGCTTAAATCTGGTCCACAACATTAGG 
CAGACAGAGAGCTAACTGGAAATAGCCATGAGATTTTGAAACCTCAAGCC 
CCACTCCTAGTGATGTCCCACACCTCCTAATCCTTCCCAAACAGTTCCAT 
CAGCTGGGAACAAGATATTCAACATATAAGCCTATGGGGGTCATTCTCAT 
TCAAACCACCAGTAGTAATTATTAGAGCCCAGCAAAGAAGGAAGGGATAG 
AAAGAAATGATTGATGGGAAC TGGGGTGAAGTC TGATAC AGAGAGATCTT 

TATGTACTGCAGCGTAGCTCAGGAAGATAACTATGGTTAAGGACAATTAG 
CTMGTGATTAGTAGAGAGGATTTTAATATTTCCAATACAAAGAAATGCT 
GCAGGCCTGAAATAGGGTACGTTTCAGTGACCCAGATCTGATTATTACAA 
CTCATACACTTGTACCAACCACATAAATATGTACAATAATTGTGTCAGTT 
TTATATTAAATAAAMTGTGGAGCMGTTAAAAAATGCCTGTTTTAMCT 
GATCACAGTTATATGCCAGCTTTTCTTTGCTGTGACAAAATACCATAGGG 
AGTAGTTTATMGGAMGAGATTTCCTCCAGCTCATAATTCCAGAATTTT 
CAGTCTAGAGTCAGTTAGTTCTATCATATTGGGCCCACAGCTAGACCAAA 
TACAATGATGGGGAGAATGTGGTAAAGAAAAGTATTTACCTCAGAGTGGT 
CAGGAGGAACACAAGACAAAATATACATTTCAGTCCCATACCTCCAGTGA 
CTTGCTTCATCCAAACAGACGCCACCATCCAATAGCCATTAAAATAGAAG 
TCAACCAGTTGATTGACATCCATTGATCTTAGTCATATCCCTAAATTCAA 
CCTCTAA€CTCTCATGCTCTGGGGGCCAAGCCTCTATTGCATAAATCTCT 

FIG.3D(63) 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 000S373A3 IA> 
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GGAGCATATTTCATAATATGAAATATTAAACAGGTCTCTCAGGAGCTGTT 
TGGTAGACTTAGTTG 1 1 1 1 I 1 1 1 I 1 1 I 1 1 I HaTTTAAGG I Mill JGGTT 
GGGTTTTGTTTTGTTTTGTTTTGTTT^ 

TTCGAGACCGGGTTTCTCTGTATAGCCCTGGCGGTCCTGGAACTCACTTG 

TAGACCAGGCTGGCCTCGGACTCAGAAATTTACCTGCCTCCTCCTCCCAA 

GTGCTGGGATTAMGGTGTGCGACACCACT^CTCGCCTAGACTTATTTT 

TTTMTCAGATTTGAGTCTTTGCCTCTGGMTCACAGTA<3CTTTTCCCAT 

TCAACACCTAGTTTACAGAAGAAAGAAAACCCAA 1 1 1 1 1 1 11 I 1 1 ATAAT 

CATTAGACMCTAGMGTTTTCCCTCCTATTAAGAAAACATATTAACGGG 

CTGGCGAGATGGCTCAGTGGGTAAGAGCACCCGACTGCTCTTCCCAAGGT 

CCAGAGTTCAAATCCCAGCAACCACATGGTGGCTCACAACCATCCGTAAC 

GAGATCTGACTCCCTCTTCTGGAGTCTCTGAAGACAGCTACAGTGTACTT 

ACATATMTCMTAMTAAATCTTTTTAAAAAAAAAAAAAAAGAAAAAGA 

AAMGAAMCATATTMCAGTATTGAGAAMCTGTTGGCTTAMTTTGAT 

GATTTGMTTTTATTTTACTMTAMTCCATGT^^ 

CACATCCCAtSCACTCAGGATTGCGAGATAAGAGATCATAAGTGCAeGCTA 

GCTGGMTAGCAAMTAAMTCTTTTTTTAAAAAATATACATACATACAT 

ACATACATACATACATACATACATACACACACACACACACTTTTCTCAGT 

AGTACGGCCMiTAGTT^CTTGTCTAAeGGAGGXaAG^AA^GGAG^Ata 

AGAGCATCCTGmAGATCA03mTCTm<3CATTCAGTCT^6GACC<X 

AGCC<:ATAGTGT<3GT<SCT^^ 
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AGTGTAGAAACTCTCTCAGAGACATGCCCAGATGCTTGCCTCATAACCAC 

TGTGTATGTATATATGCTTACAGAAAATATACTCATCATTACACATAAAT 

TTCATCCACTTACCTCTTATGAAMGTTGATTATTTACTGAGATTTTTCT 

CATTCTGAAMTCCATAMGTCTACCACATTGATTAMTTACTTGTTTTT 

TACCTGTTATTGCTCATGTTAGMTTGCTTTCCTTATTTGGGGTAAGCTG 

TCGTTGGCCACTGTGAGGGGCTTATCAAGAAGTCAGAAATGGGAACACCT 

TCTAGGAAGTCAGGACTGGAAGCTTAGCTGAGCCAGCAAGTGTTTCTCAC 

ACTGCACTTCCTGTGAGCCTACCTGTGCGGCATCAGGAACTGGAGTTGGG 

ACCTTGAGGATTGTTCCCTGGAGGCAGGGGTGGAGTCAGGCAGGGGTGAA 

GCTGACTCACMGATGGTCTTGCCTTTCAGTTGTTTCCTCTCGCTGCTTC 

TGGTGGCTGCAGTGGTCTGGAAGATCAAGCAGAGCTGTTGGGCATCCAGG 

CGGAGAGAGGTMGCCCMGTAGACAMCTCCACATAAAACTCATTITrT 

TCCTTCTTTCTAGGCAGATCACTTTTACCTGTTGAGTGATGACTAATATT 

CATATGAGAAGCATGCTGTTTAACCTGCATTCTGTGGTTCCACTATGTGC 

CATCAGTAGATTTTMTTATTCTTGCATAMGTGTCATTAGTTTTGCCAC 

TGCTTGATTCAAGTCTTCCTAAGAGTCTTTCCTAAGAATATGAGTGTAGA 

GACAAGTTCAGCTCAGTGACAGAGCACTTGCCTGGCATAAACTGAGTCCC 

TGGATTCTAGTCTCAGCACCCTCTAATAGCACAACACTAGAGACAAAGCT 

TCTAACCTGTGGGTCTTGGGCAGCAGGTAGGGGAGGGGGATTTAAAAAAC 

AAAAACAAACCTCTAGCTGTAGCCTGTGTCA7TTGTTATGACTAAGCACT 

AGAGTGGGTACTAGTAGACATGCCATGTGGACATTGAGCACCTCTCCATC 

FIG.3D(65) 

SUBSTITUTE SHEET (RULE 26) 
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CCAGGCACTGATCCAGGTGGTTCTGCTTTATCTTCATCTCCACCCTAGGA 

TATAAGGGAGGCTACGTAACTACCCATCACCACACAGATGCTGAGGTACA 

GAACTGAGGGGTAACTAGTGCCTCTGCCTTCACAGCACAGGTTCCTAAAC 

ACGTTTrCTACAMCACTTCATTTGTTCTAGTCTGTTCATTTAAGAATCT 

CATGTTCTGACTGAATGAGCTAGACAACTCACCCTAGACTATACATTCTA 

MGMGGGCMCMGGCAGTTTTGTTACTGTTGAGAAGAAAACAAAGTTA 

TTTC C GTATGAGTTATTGAGATAGAAT AGT AGAGATTTGTC TGAATAC AA 

AATAGAAAGTATATAAAAGTATATAAGTGGATCATAAAGAAAGCAACAAT 

CAACTGGAAAATATTTGCAGTATCATGAGAGAGAGAAAACTAGAAGAT-GA 

ACCCCCTCAAAAMGGAT7TTTAAAATATGCTTAGACTGTATTCAGTCAG 

CTAATAAAC 1 1 I 1 1 1 I ACCTTTATTTGGAATTTACGAATAGCACTGAACC 

TGACCATTGTAAATGCACGAGGTCAGGCATGACTTGTTCCCAGTAGGAAG 

TTGTTTTTAGTTCTTGCTGTGGCCTGGGTCCTGATGGAAGTTCTTTACCC 

ACCTTATCTCCTGTCCTCTTGGCAGAGGTTCTAGAATAGTGCTGTGAT6G 

GGTAGCAACTGTCTTCCTGTGACCCTGCACCTAGATTATTACAGAAGCCA 

GACTGGGTTTGCTGAGTTAATGGAAATTCTTTCTAGGTTCAGTAGAGAGA 

TGTGCTGACACATACTAGG«:ATCTAGTTTTTCAGTAAT<BCTCAGAGACC 

GCMTAGGATATGTMCAGCMCAAAATTTTTAACATAAAATTTCCCTTC 

TAAMCAGAGTGATGATTTATGTAGCTTCAGGATGCT^CTCCTAGAAGA 

TGGTTTGMGCAA^GCCAGTrrGTCTTCCCTAGCATAACCTCAGAAGACC 

TCTCATATTATTCAT<3GTATAGGAA^ 

FIG.3D(66) 
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ATGTGTGCTATAGTATCTCATCTGACCCAACATGAAAACATTTCAAGCCA 

TGTGTGCTTGGGTAAGGTAGGAGTTCAAAGTCATCCAGTGAGTTCAAGGC 

CAGCCTGGGCTGCATGAGACACTGTCTCATAAACAGACACTTGAATCTCA 

TTTAAAGAAGACATTGAAGACTTGATACTTTGAACACCTATCCTAACGTA 

TCCACCCCCAAATCCAGAGTCCTTCATGTTCTTGTCCTCTGCAGTTCCAC 

TTTCATTGTGTTCTCAGCAGCAGCTCTCTCCGAGGAGAGTTGTCTCCCAT 

CCTATCAGCCATCTTTTTTATTGTTGTTGCTCTGACAATGTCTGGTTCAG 

GTTTTMCACAAAGCMGCTAGAGTGATTTTAATCTAGCAACAAAAATAT 

AAAAAGGTMGTTTTTGCCCTTTTATATATTCAATCAACAGATATCATAG 

CATTATATCCTCCACTTTMCTTTTATTTCTTACTGGTMGGGCTTTTTA 

TAAAAATATAATAGTGTTACCACATGTAACAAAATTTGATACCTTGTGCT 

ACCTAGCACCTTGTCATGTCCAGTTTTCCTCAGCTGTCACAGAAGCGACA 

CTGCATCTGATCAGTTTGAATCAGAGAGAGTGTAGCATGTCTAATATCTA 

GTATTCACTAATAAAATCTCAGTACTAAGCATATTAATAATACTATATTA 

TTCATTAGCAAC7TCTTCGGGAGATGCAACAGATGGCCAGCCGCCCCTTT 

GCTTCTGTAAACGTTGCCTTGGAAACAGATGAAGAACCTCCTGATCTCAT 

TGGGGGAAGTATAAAGGTGAGAAGTGGCTCAAAGGTCCATATAGCTTTTC 

AGAACTCAGGCCTCAGTTTGCTAGGCTACAGACAGCAAGCGCTCTGTGTG 

TCACTCCTGTCTCCTCTCTAACAGTTAGTCAGCAGAAGCAACCCCGAGCG 

ACCGTMGGGGCTCTGTGTGTCGCTTTACTTTTCGAGTTGTTGCATGTCA 

GATTTTMCATGCAAATTMGCTTGTTATTCTTACTTTGTGGCATAATAC 

FIG.3D(67) 

SUBSTITUTE SHEET (RULE 26) 
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TTTATAGTTTTTATTTGGIAAATATCTAATCTGGGCTAGGTGTGATGGTGC 
ACATCTTTMTCTCAGTTCAGAGGAGGCAGAGACAGAGGCAGGCAGGATC 
TCCTTGAGTTCTAGAACAGCTGGTCTACATAATGAGACCCTATATGTTAG 
AAAAAAAGAAAGAGGGGGTGGGGGAAGGCAGCTAACTTTAACCATTAATT 
GAACCMCACACACACATTTTGTTCAGAGCCCCAGTACTCAATTAAAAGC 
CAGGCAGGCATGGTAACAGTACTTAGGGAGTCAGAAACAGGATTCCCAGA 
GTAAGCAGTCTGACTAGGCTAGCAGGAAATGGTGAGTTTCAGGPTCAGCA 
AGAGGCCCTGCCTCAGTAAGTAAATTGAAGAACAACTGAGGGAGACTT'GC 
ATGTGCACTTGTGCATGCACCCACACATGCACTT-GCACACATACCATATG 

tcaccatgc™gactataaaatgtagtcactactggcagcacatggcta 
caatacagatgcaggagaatcactgcaaatttgagatcagcctgggctac 
tggacmgattttgtctcmgaamctaaaacaatacaaaagtctactgg 
gggggttattctmtgccagtgtttat<^cagcacattcagaact<gacag 
tamggcmtcaaggactgtcagtggtgggtatatacataggcagaggag 
caactgctactagaaactgtttatcctttaaaagactaatgtatgctgca 
gcatagacamcgttmgttgtgttaagtaaaagatgctgtatcattcca 

CTTACCCATCGAGMTMTCAAATACAAGACAGAGTAAAATAGTGACT"GC 

TAGAGGCTTAAMGAAAAGACCAGGGGGTG6GGW 

TGGGAGAGGGAGGMG^GAGAGGGAGGGAGGGAGCCAGACTTTGTGGC 

TTACAGCATCAAGAGGCTGAGXaCAGAAGGGTTACAAATTCAAGGCCCTAC 

T^CTACATAGTGAGAAGTAGGAmCCTOAGCTGTCmCTAGGTCA 

FIG.3D(68) 
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TAATCTCTCATTGGGGGAAGTCAGGGCAGGGACTTGAGGCAGAAACCATG 

GGGAATGCTATTTGCTGGCTCCTTCCCAGGCTCCTCTCTAGCTTTGTTTT 

CTCATTTTGTTTTTACTGTCTATGGGTGTTTTACCTGCTTGTTTTTCTGT 

GTACCATATACATGCCTGCTACCCACAGAGGCACTGATGCCTGGAACTGG 

AGTTACAGATGGTTGCAGGCTGCCCTGTGAGTGCTGGGAACTAAACTCGG 

GTCCTCTACATGAGCAAGTGTTCTTAACCATTGAGCCATCTCTCCAGCCT 

ATAAAATTC 1 1 I 1 1 I AAAAATAAAGTCTGCAACAGAAAATGAATA7TTTC 

TAGAGCTGAAGCATTCAATGAGTGGATAAAGAATCCATTTGATGAGCTAT 

CTACCTTTCACAAGCTCTTAACCCCTACAGACTCAGGACTTAGTGGCTGG 

AAGATGAATGTAAAACAGGTAGCTCTCTCCATAATATCTGGTCTGTTTGT 

GCCAGGTGTGCAGAACTGTGCAACAGGTCACCATACAAACCGGCGTGGGC 

CTTTCCTGAGACTCACACAGCTCTCGGGACAGTGCCCGTGGGGACCTCTT 

ATTGACCTTATAAGCACCTGACTGTGCAGTGTAGCAGGGAGTTAAGGTGC 

TTCTGTTTTCTTCCTCCAGACCGTTCCTAAGCCCATTGCCCTGGAGCCCT 

GCTTTGGTAACAAAGCCGCAGTCCTCTCTGTATTCGTGAGGCTCCCTCGA 

GGACTGGGAGGAATCCCTCCTCCTGGTCAGTCAGGTGAGTAGACAGGAGA 

CAATGACAGATATTGGTCTGTGAAGGACTGAGTCTTAGACACTTCTTCTG 

GTATAGAACCTGGGTCTGGGCACAGTGCTTAGTGGTACAGAGCTTTGGTG 

GAACAATTCTATAGTCCCCAAACTGTGTTCTGAGCACTGACATTCCTGTC 

CTGGGGTGGAAGTTCAGGACCTTCCTCACGGTGCACAGCGTCCTCAGACA 

TTCATGCTCTGGTCCCCTTGACTCTATTGATCCCTGCTTTC I 1 1 1 1 1 H I 

FIG.3D(69) 
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TTMCCCCTTGTTCTTATCTCAMTTTA^GCTTTTTCTTCCTTGATACAA 
GCTCCTATTCATCTCCATGCCTCT-GGCTTCCAGCCATGTCCTCAAAGCTT 
GTGTTGCCAAGTACAGAGTTCTAGTCATGCTCCACATCTTCTTAAGGTCT 
TGCTATGCAGCCTTAGCTGGACGAGTGCTCGTTATAGGCCAGGCAGTGGT 
GGTACACGCCTTATGTCCTAGCACTGAGGAGGCAGAGGCAGGCAGATC'TC 
TGAGTTCAAGACCACCCTGGTCTACAGAGTAAATTCCAGGACAACCAGAG 
CTACATAGGGAAACCCTGTCTCAAAAAAATAAAAACAACAACAGGAACAA 
CCCCAAAAACTCATTATATTGCCCAGGCTGGCTTCAAACTCATAGTTATC 
CTCCTACTTCAGCCTCCAAAGTGCTGGGATTATGGGTGTGACCCTTCATG 
CCCAGATTGTCTTAAATATGAGGCATGAAGAAGTATTATGAAAACATAAA 
GGATATTTTGAAMTTATMTTCTACTGGGTTAATGCAGATCCATTTTCA 
TTTCATTGAAATMTGATACAGCCTTTGGAGGTTAGGGGAGCCTCTCCT'G 
TTTTCAAACTGACTTTGAACTTCTGATCATCCCGCCACCACGGCCACCTC 
CTCCTCCTCCTCCTCCTCCCCAGTGCTGAGATACATCACTACTCCTGGTT 
TATGTGGCACAGAGGCTCAAACCCAGGGCCTCATGCATGCTAGGCAGACA 
CTCTACCAGCCAACCTACCCACAGCTCCTAGATGTGCACGGTATTACAAA 
CATTTATTCTTCAGCATG 1 1 1 1 1 I 1 1 I 1 1 M 1 1 CCTAAAAATC ATCTCTA 
CAGGAMCMGTACCAGT<^TG7TTTAGGGCAGGMTA«GAAGAAAATAT 
TTTTACTATATACTC 1 1 1 1 1 1 1 1 I MTCATTTTTTAGATTTTATTTATTT 
TAAMTTTATTTACTATTATTMTMGTACACTGTAGCTCTCTTCAGACA 
ACCCAGCAGA€GGCATCA^TCTCA™€<^T<3GTOT^G€<:AC<:^ 
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TAGTTGCTGGGATTT6AACTCAGGACCTTTGGAAGAGCAGTCAGTGCTCT 

TAACTGGTGAGCCATCTCTCCAGCCCCTACTATATACTCTTTTAAATGAC 

nAmGCTmATTTTTATGTGCATTGGTAATCTGCCTGCATGTATGTC 

TCTGAGAGAGGATCAGA7TCGTTGGAATTTGAGTTACCTTGTGGGTGCTG 

GGAATTGAACCCAGGTCCTCTGGAAGAACAGCCAGTGCTCATAACTGCTG 

AGCCGTCTCTGCAGCCCCTACTATATACTTTTTTTATAGTTTTGMTTTT 

TTTTTCTTTnGGGTATTGCTMGGATCAMTATAGATCTACTATTTATT 

T7TTATAACATCCATTAGTATTTTTATMCTTACTACATAGTTTGCCAAT 

TCTTTTATACATGTCCATCAAACATGTAAGTCATAATTTATATAAACCTT 

GTGTTAAAGCTGGAGGCACAGAAGGAAGATTGCTACAGAGTGAAGTCTAG 

ACTAGCCAGGGCTATATAGTGGGACCCTGTTGCAAAGAAAAAGTTCTCTC 

TTTAAACACAAAGGCAGTATGAAAAGACATACCTTGA7TCTGAAGCTGTG 

CATAGGAATGCCTCACACAGTG7TCTGCTCAGGACTATACTCAGATGCAG 

TGGTCTGAGGGACTTGGTGGTGTCTCAGCCAAAATAACCTGGAGTTTAGT 

AGGAMGTCTCCTTTATCCGTGTCCAGTCCTGAAGGGAAGCCTTATTTAT 

GTATGATGAGTCAGGACCCATTGTCTTCATCTTACTTGGCATCCCCCCAG 

CACTGAGTCTCTGAGTTAGCCTTACTTGGACAGAGTGACTCTCTGGGCAC 

TCTGGACAGCATCTCCTGCTTCAAAAGGGCAAGATCTTTAGAAGACACAG 

AGATGGAGCAGGTCTTACATGGAGATATAGCAGCTTTTCCTTCCTGACCC 

TTGACCCMTGCTTCTTTGGAAATCCTCATGAAACCCTGCTCCTTTCTGG 

AGACCCACCCCACAGCAGGGTTATCCATGCCAAGCTTCCTGTACTTTCTC 

FIG.3D(71) 

SUBSTITUTE SHEET (RULE 26) 
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TTTTTGAGGAAGCACATACACACAAAGTTTTAGTAGCTCGCACATCTCAC 
TGTGMGTAGTGATACTnCATTGCTATCTTCTGGAAACAGGCAGGAGTA 
GGCACACGCTCAGAGCATAGCTGCACTCTCATTCACTTGCCACCCTGAGG 
CAGAGCACACGACTTTGTGATCTGCTATGGAGGAGAGAGAAATGAGTAGT 
TAGGTGTGTATAAATAAGCTAACACCATCACCCCTTTATCTTTCACTAGG 
GAMTGTAAAMGAMTCTGAMTTATTTTKaTAAAAAAGTAAGCTGCTTC 
ATGACACATGTCCCCTCTTGTGGGTTCTTCCAAGGTCTGGCTGTGGCCAG 
TGCCCTGGTGGACATTTCTCAGCAGATGCCMTAGTGTACAAGGAGAAGT 
CAGGAGCTGTAAGAAACCGGAAGCAGCAGCGGCCTGCACAGGCTGGAACC 
TGCATTTGATACTGGGGCAGGAATTCGCCCTCACAGAGGGeGTGTGGTCC 
ACGAAGCTGTCTACAGGGGAGGCTGCAGGCAGGAAGCAGGGGTGGGGCAG 
AAGACTGGGGACCCTTGAAGGGTCCAACTCATGTGCATGATCATGCAAGC 
TGTTTTCATGGCTCACC<XTCTGTGTCCAGCATCTMCCTTTTACTTCTG 
TGTAGGAMTMTTrAATTACAAGTCCAGGAATGGTCTGCTCTACTCATG 
GGTGGAGGAGACCAGTGCCGACCCCGTGAGAGCTGAAGGTGATGCTGAGG 
TCCCTTGTGGAAGCCTCTCTTGGGAATCTCAACTGCAGAGGAGCTGCCCT 
CTGTCAGCAGCTCTCCAGCATGGTCCTCTGACACTGCTCAGATGAACTGT 
TCTCATCGGAAGCTTGCTGTCTTTTTACAAGATGAGCTTTTACTCTCTTC 
CAGGMGTAGCTTTTmCTAGCTGAGMTTMTMTCGTCmCTCTTT 
GGAAGTCATATCAMGTATMTTGATGG<BGGCCTT<^^ 
^TTTTf 1 GGAGACAGGGTCTCACTGTGTAGTCCT^^TGGGCTGGAACTC 
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ACTATGTAGATCAGGCTGGACTGAACTCACAAAGATCCACCTGCCTCTGC 

CTCACAAATGCTGGGATAAAAAGCATGAACCACCAGGCCCAGCAAAGAGG 

GCTATTCTAAATGTCAAGGTCAATGGAGTTAGAATATATATAAAAAAATG 

CMTTGATMTTCTCTATAGAMCTTGATTMTTTTAATCCATTCTTTCC 

TTCTCTTTCTCTCACTCTGTCTTACACACATGCACACATACACACACACT 

AAGTGCCTAGACTTTGAATAGATCTAGCAATTGGACATTAGTAAGCCTAA 

GTTTTTACATGATTGCATTCCTACATTCTTGTAAACTTTAAGTAACTACC 

ATTGCAGTTTGTTCTTTTTTTAMGTCTMTTTGCAGCCAAGAACGAGTA 

ATTCTCACCCCAAGCAACATCTAATAGGGACTGAGTGACCCCAGCCCAGC 

CTAGTGTCACTTTAGGCCTGACGTTTGAGCAACCCTCGGCTCTTGCCAAG 

GCACCACAGAATGCACTTGCTCATGCCCTGTGCCTCTTGAGCAGAAAAGA 

GCACTGACAACTGGGACACCTGGCTCTGTCTTCCTACAGCTGCTCGCACT 

GACCTGTGGGAACCTGTGGGTCATCCCCAGGCTGAATGGAGTACACACTA 

GAAGAGGGATGATGCCTAGCATTGGGGCAGCATCTGCTCAGCACATGGAA 

AGGGACCTGGTTCCATCTCCCCTGGGCAGGAGTTGGTCCAGCCTCCTCCC 

AGACCCAGCTGGTGGCTGTGAGGAGGTGGGGAATGCTAATGAGAATGAAA 

AGCACATGGGTTGATGGGAAGGGACAAGATTACCACGTTAGGAGGGTGAG 

CAGCCCTCTGCTATGTGCCCAGGACCCTGCCTGGACATTGCATTTCCCCA 

TTTATGGTGCTCCGTATTCTGGCATTATGGAGCAGCCTCACACAGCTGTC 

CTCTCCTTCTTCATGTCCTACAGTTCTGCTATCACCTGACTAGAATAGCC 

CTCTAGGCMCAGTGCTCAMTGTATGAGTTTGGAGAAGTTAACAATCAG 

FIG.3D(73) 
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AAGAACAAAAACTGTAGTGTTTCACCTTTAAATGCAGTGTTGAAGA6GGA 
GCCTTTCTCTAAGCCCTGCACTAACCCACTCCTGCCAAGACTCTTGTGGA 
6TGACAG7TCCAAGCTGAACCATAAATCACTGATGCACAAAACACTGCTA 
GMGGCTCACCTCTCAAAACAC^CTCTTTGCATCACTATTAAAGAGCAG 
AMGTTCTAGAAATGATCCCAGCCTCATCCCCTATACAGTTAGGAGCTCC 
CCACATCTCTACCAAAACCCAGCACATAAGTATCTtaCGTGGTOTAGCCTT 
TCATCTCCGTMCMGCCAGGGGACTCTTGGCCAAAAGAAAGAAAGGGAA 
GTTGCACTAGGGCTTGTCCGTCCATMGGAATTCCGCTCTCCTTTGCTCA 
MGGACCAMTTTCTTTGGCCAAAGAAGTTGCTTCTATGTTAGTCCCATA 
CCCTGAAGTAATATGTACCATGGCTCCCACCTACCT^TTTATGCTCTCCC 
TGCCCCCAGGGAMCTGTTTATTCTTTCAAAAGAAGCAAACAGGGTTCAT 
7TCTGCTCCTGTAATGGAGAAACAGCCAGCTCCCCTGCATCCCTTACAGC 
CMCAGCTCCCTTCAGGCTTAGAGCAGG^3GGMTGGCAGGGATTAAGAGC 
TGAGCTCAGAGCCAGTTACCAAGATCGAATGGAGTTGT'GACCCAGTAACT 
GTGTCACGAGAGACCATGTATATAAAATAGTCATGACGACACTGAGCTCT 
TGCACTTGTACATAACTATACTGTAGTGTCCAGAATGTTCAGACATTCAG 
GGTGTACATAAACAGAAGAGTATCATAATGTATTTTTATTAAACACTAAC 
ATCTGAGTTTCACCTAATCTGTTTCTGTGCCATATACTGGGTATGCAAGC 
TCTGGGAAGTTATCCTACCAGGCCCTCATCTGTTGATAAGGCACTATACA 
CCATGCTGGTGTCTTCTCTAGGCTTCT<KCCA1TAGGTMCTGAAGAATG 
ATT£AGCTCTTAGAATACGTAGGAAGACA<£AA^ 
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TGTGATCTAAGCATTCA6AAGACAGAGGCAGGAAGAAAATTCAAAAATGG 

GGCTGGAGAGATGGCTCAGTGGTTAAAAGCACTGGCTGCTCTTGGTCAGG 

ACACTAGTTCAGTTCCCAGTACCCACATGGTGGCTCACAACCTTCTGTGA 

CTACAGTTCCAGATAACCTGACACCCTCCTCTGGCCTCCTCGGGTGCCTG 

TGGTGGTCCACCTGGTGCACAGACAAACACCCAATACACACAAAACAAAA 

GTAACTCAAGAATAGCCTGGGCTACATAGCAAGAGCCTGTCTCAAAACAA 

ACGAACCTATGAAGAGCCAGGCAGTCTATCTATTTACATGGCAGTATACT 

AGAGAAACTCAGGAAGCAAGAGTGTTCATCACTGTTGTAATTTCAAATGC 

TCCTTGTGATTTCTGGCATCTCTGTGGGGTGAGGTGTTCTGTTACTCTTC 

ACATTCAAAGACTGTCACCCATGAACGTCAGACTTTGCAAAGGGGCTCTC 

TMGCTGCACTGTTGTGGCTTTGTCTAAAATTTTAATGACGTTTCTGAGA 

ACCATGTTCTTTTTATACTAAAATCTGGGGATGGGAGGGCTCATTTGTTG 

ATAAATAGCACTATTTTCCCACACCTCAGCCTCCTGTCCCCGTCCTGGTC 

TTCCCTACACAGTCTGGAGAGGGCTCTGAAAGGTCCACAGAGTTTGACAG 

ACACGAAAGCAACCCATTGCCCCGTTGACCTGACCTGGAAGAAGACTGTC 

AGCAAAAGGAAAATACCAGAATATCTGGAAAGCTTGAAGTGTAAGATGGG 

ATCTCGTTGGGGAATTGGATGAAGAAAAGCAGAGCGCCTCTGGTAGGTGA 

CTCTGCAGCCTGCCAGCGCCCGCCCTCTTTCTACACAGCAGAGTGTGCAT 

GGCAAGGAAATGAGTCACCTCCTTGGGGGATGGTGCTGTTTTT ATGAAAA 

CCTCTGATC CTTGGTGTC CTTTAATTGATCTGTTC AACAAATATTTACTA 

AACACTTCTAAGCTAACATTAGGGCAGTGACTGAGGTGGAAACCCAGCTC 
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TTTAGACA6CTGTCATCCTAGGATAGCTTCCTG6AAGCAGAACCAAGAAG 

CCAGAAGGTTCTTCCTA6GGT6GCCTTGGCTCCCT-GAAGGAATCTGAAAT 

GCTGACCCTGTCACAACCTCCCAGCACAGCTTTGGAATGAGACATCAGCC 

TGGCCTCCAGCAGAGCAGAGGCTCTGGAGCTCCACATCCTGCCTGCAGGG 

AGCCCTCAGGGTGCCCTCCAGAGTACAGGGAGAAACTAAAGGCAATAACA 

GAAGCTGCTCTCAGAGCCTGACTGTGCACAAAACACTAGTGAAGCCTGCT 

GMCTMTTCTGCCTCTGGAMTCTTTTCTGGTTCTTTACAGTTTGTTGT 

TTTGTTTTGATCCMGCTTAGTTTGTTACTATGTGTGATTTAGCATCTCT 

CGCACTTGTGTAAATATGGAGTAAGTATTGTAAACTATTTAATTGCTGCG 

ATTGTTGGGTTATACATACATTTAGGACTGCAA 1 1 1 1 1 \ GGTA1 1 1 1 1 1<2 

TATTGTAAAATAACAGCTAATTTCATCAGGAACAAGAGAATTAAGGGGGT 

CTGCATTTTAMTGCAGATGTGAAGCACTT-GTATATAAATAAAAGTAAAT 

ACTATAATACAAAGTTCCTTCTGAAATAAAAGTAGATCTGGTAAAAATGT 

GCGTGCGmCGnCTCMTGmMTCCTAATmGTTTTATTTTATAT 

TTACATTTTAGTCCTTATTTTAGCAGTCAG^GACAGGCACAGCAGTGCA 

TTCTCACCTTGGCAGCTGAGGMTCCCCTAGAGTAGACTGCAACTCMGA 

CTCTTGGCTTCCACACTGMAAGAGTTTCAGTTTATGAAGCAGAGTiTAG 

GMGTTTAGTGAGGAATTTMGGAC7TCT7TTMT<^^ 

TGTGGGTACATATATGACACAGCATCCATCT€«AAGGCAMCAACACCTT 

MTGGMGTG^T<^GMC/WVCTCAGGACTTCMTCTT^GCAGCATA 

MCCTTTACCTMT^GTCATCTCCAGTCTATAC^GTCTGTGTCT^ 

FIG.3D(76) 

SUBSTITUTE SHEET -(RULE 26) 

8NSDOOD: <WO 0005373A3 1A> 



WO 00/05373 



PCT/US99/16484 



112/173 

CACATGTGCAACAGCACAGAGTG6A6GTCAGCACAACTCTTGCGAGTCAA 

TTCTCCCTTACCTTGTAAGACCTAGAATTCCACATTGCCCAGGCTCTGAA 

AGTTAGGTTGGGTCCACACTGGGCCATGGCTGATGAAATGTTGGAAAAGT 

GATMCACCAMCTTTTGCACAGAAAATATTTTCATCTGGGGCCTTCCCT 

GGAGTTCACAGGCTAAAGTGTTGGAAGGAACATGGGTCCCTGAGCCACCA 

CTTTCACAAAAACTACCTGATCAAGAAGAACTATTCTGGGTTTCTGTTGC 

TAAAATTCCTTCCCAGAGAGAAATGTAAGCAATGTCTGCCCCTTCAAGGG 

TCCGAGCAAGAAACCAAGGCACAATTCCACCAAAGTTCACTAGAAAACCA 

GTGAGTTTATTGGGCTTCCGTGCAGAACATACATGAGGGGTTACTTAGAG 

AAGTGTGGATACTCCTCCCCCTAACAATCCACACCCTGAAAAAGCCTTAC 

CCAGCAGGGATGAGGGCTTCCCCAGACCCACATTGATGGTGCTCCCATTC 

CAi N I l CCCTGGCATGCAAAGAGATAGACAGAAAAATAGATTATATATA 

ATATACACATAAATTAGAAAAATAGATTATATAATATACACATAAATTAT 

ATATTATATATATAATATATAATACACAGATAGATTATATATGATATATA 

AAACACACAGAAATAGGGTATATATAATATATAATACACAAACTACTCAG 

CTATTAAAAACAGTGGATTCATGAAATTCTTAGGCAAATGGATGGAACTA 

GAAAATATTCTGAGTGAGGTAACCCAATCACAAAAGAACACACATGGTAT 

GCACTCACTGATAAGTGGATATTAGCCCAGAAGCTTGGAATACCCAAGAT 

ACCATTCACA6ACCACATGAAGCTCAAGAAAGGAAGATCAACGTCTGGGT 

GCTTCTGTTCTTCTTAGAGGAACACCCTCATAAAGTAGTGGTGGGGGGTG 

GGGGGAGACAGAATAGGTGGTTTCCAGGAGAGGAGGAAAACAGGAAAGGG 
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aataaataacattt6aaatgtaaataaagaaaatacccaataataaaaga 

aaaagaatttt6aaaca6agggtaaaaaataatacacaaaccaggtagat 

agattatatataatatatataacacagagatagatagatagatagataga 

tagatagatagatagatagatagatagatagatagataggtcaact^ctc 

gcccctccactaggtaacatgcagttaaggcagagctgcatacaagagat 

gttagggatactcaggtgagaatctca<^tttgctccatccatctatgc 

tggggtgtmgctgtcmcmgtttagctcggatgatgctttgcaagagg 

gcacagctgaatgccctaagatggtagatgcttggctcaaaggagacact 

acagctctgcatcmggcamctmctgagatgaggggctttattttcca 

gatctgtatcctggagcatcattcacctgttactacactgaaaacatttg 

gtgttggtttcatg^agat(^cag<3cagtgagagaagtacagcagcgga 

ctgctagaggtgggggttctctca^gacgtg^ga€gctgtttggttagta 

acttggamgcmcmgtttttagctagagggagaaaagctggagataac 

tgtacttgctt^tttcttaaatatcaaattttattttatgcatatgggt 

attttgcttgcatgtatkagctatacactacat^ptct^tgcccagaga 

gaccagaggaagtagtgtogcctct^^ 

acttgagtg€ctgamctgmg€tt®3tcctctcg^gmcagcgagggc 

tcctaaccactgagctatctctcca«:cct^cagaacatcatgtactgc 

AGGCTGGTCTCAAAm^mATAGCCMGAAG^TCTTAAATTCTGAT 

CCTCCTGTTCTCTCMGTA6T<3GG<a^ 

TC^AAATCATTACAAATT^GTOTAA^^ 
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AGTAACAATCTGGAATTTTGGTCTCTTAAAAAAACAAATATTATAAGAAT 
GTATTTTCAT7TTAATCCCAGGTGTATGGCATATATCGAACTGCTTTGGA 
CTGACTACAGCAGCTATGATTTTTTCTTGTTCTAGCAGAGGTATGGTTTT 

gccagctacagatagtttctgtgattgtgtgacatttggaattctggaaa 
cttttcagatggtatataaata™gagccccaataggcagagttgatga 

TTGTTGGTCATTCAGGGGTATTGGTTGTGGTTAGTAGTCTTGCTTGAAGA 

AGAAACAAGAACAAATOGATTCAGAGATCTCTATATCTCTCTCTATCTT 

CCTTTCTGTCCTATCTAGTAATAGGGGGTAAAACCAGGATGATAAAGGGT 

TGGGGGAACCCACAAAGTAACAAAGACTGGCTACAAGTGGCACCCAACTT 

GGMCTCAAAATTGCCATAGAGGAAGCAGCAGGGGATGAAGGAATGGATT 

GTGGCTGTTGTTGCTGGGATATTCCTCACTTTGCTCCCAGAGGGGATTTT 

TCTGAGGTTTTGTTGTTTTGCTTTGCTTTGCTTTGGTTTTCTTCTACATA 

TTCTGi 1 1 I I I AAGTAAGTTGAAATAATAGCCGAGAAGCTGGAAAAGTTT 

GGTGTGGAAATGGAGCAGCCTGAGAAACAAACAATGTATGAAATGGGAAA 

ACTAMGGGGCCACTCTTCTCTCTTTTCTGAAAGGCTTGCAGACTTGGTG 

GTGCACCTGGAGAG7TTATGGATGGAGATGGAAGCTCTTAGGAGACAAGA 

AGCATGGAAAAAGAGAACAAAGGCTCAGTCCCAGTGACTGAAGAGAGCAG 

GAGTTTTCCAAAGAAGGTGCATGGGAGGGCCACTGGTCAGAAAAAAAAGG 

CTGAAAAATACCAAAGGACAATGTGCTGAAATAGCCCATTTCAAGAGAAA 

GG^TTCATCTCAMCCAGCATTCTGACAGAGTGGAAGGAGGGGTGGCTCA 

^TOTGAGATCACCATCAGCTmCCAGTTTTCCCATATAGCATATCC 
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CTGCTAATGGTATGGAACAAGAGTAAGGCAAAATAGGATCGTCTCCTATA 
GAMTGATAGCTCTMGGTGTTTTTAAAAGGCCTTGATTTCATATGGAAT 
GCACTCTCCTTATGTGGAACGGATATTAAATAACCiaGGGTACACAAACTA 
GMTCCCTTCCCAAGATTGGAAGGGATTGGTAACAGCTGTACTAGAGACT 
GTCAGCCGTTGCAATGGTTAACATGCTGGAGGAAAGAAGCTGTGAACATT 
GAACAGTAAAACAGAGCAAGGGGTATTAATATAGTGAACGAACAGCTGCT 
AGGTGAAGGGCGGTACTCTAGTGTACAAGCACAGACTCGGTGTCATGAAA 
CTACTATAGAACAAGGTTGCCTCAGTGGCTATAACACCTTGGGACAAAGG 
AGGAGCCAGGAAAAAGTCCAGTTCATTTACAAAGATTATATAAGGCTCTG 
GAGAAGCCTTCACTGA I 1 1 1 1 1 1 I 1 1 I AC AAAGATTAGTCTC AGCTATGA 
ACAAAGCCATATCAGACCCTGACACMGGCAGGTGTTGATAGAGACCTTG 
GTGTATGACAATGCAAATACCAAATATAAAAATGTCATTAGACTTTTAAA 
GGCACAAGTAATGCCTATGGATGAGTGGATAAGGGATAAGACCAATATTA 
GTTCTAATGTGTACTGTGCTAATATCATTGATCAAGCTATAGCTAGAGAT 
CTCTGATGTCAAMTCCCTTGTGCTTCAGTTGCAGCAMTACAGTAATTT 
GCMGGAGTCATTGTGGCCAAAACTAAAGGTCTCAGATCTCAAAATGGCT 
GATGCTTTGTGGGAMTAGGGTCATTTGCAACAAAAATGT-GAACAAGACA 
TCTTTMGGGCMTCGTTTTTCTAMTATAMCCAGAAAGA€GGCCTAGG 
CTTCCMGGTTGT^T^C^T<aT<^CAG^TT^CACTGGACCMTGA 
GTGTAGGTCCAAMGAGATATTCMGGTAAG^TATTACCATCATGAAATC 
GTCTTGGGG<KCTATCTT<^G6tt^ 
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AACCAGAGAGTGGCATGGAGACTCAAAACCTTCACTGGGCACTG6AGATT 

TAATGCACACTAGCTATTGCAGGCAGCATGGCTCTAGACTTGGCCACAGA 

TAAACATCTTGCTCTATCCCCCAAAATTCAAAGTTATAACATAGCTACTG 

GAGTGTATGGTCTTTTTCCCTCAGGGACAGTAAGGATAATCTTGGGAAGG 

AGTGGATTGACTTCCTAAGAATTCACTGTGCATCAGGAAGTATAGATGAA 

TATTTCAAAGGAGAAATTAAAATTGTGGCATATGTAAAGGTAGAGCTGCA 

ACTTAACACAGGCGATAGGGTTGCTCAGCTGCTGCTGTTTCCCTATATCA 

AAGGCAAAGCAAGTGCAGCAGAAAGAGGAGAGGCCTGAAAACCTTGGGCA 

CTGACACAAAMTTGCTTATTTCATTGAAAATGTCTGTTTATAACTTCCC 

ACTATACAGCACAACAGGAGGGGGCTTAAAACATAATGGGGAAAATGTCA 

CMTTCTGCMTTTTTGTTTCCTTAAAAAAMCACACACACAGMTTTTA 

ATAATGTGTTCTCATCTTAATCCCGGGTGTGGGAATTAGGGCTGCTTTGG 

ACCATTCCCAGCAGCTGACTATGATTTGCCTCATGCTCTAGCAGAAGTAT 

GATTTTTGCCACCTGCAGATAGTTTCTGGGATTGTGTGACATTTGGAATT 

TTGGGAACTTTTCTGAAGGTATATAAATGCTAAGGCCCTGGTGGGGAGGG 

TTGGTGGTTGGTGGTCATTCAGGGGGGTGGTTGTGGTTAGTGGTCTTGCT 

CAAAGMCAMCMGAMGTCATTTGATTCAGATGTATCTTTCTTCCTTC 

CCCCACTCTTTCTCTCCTCCCCCCGGCACCCTGCCCCCTGCCCCGACCTC 

TACCCTTCTTTTTCTATCTAGTGACAAGGATGAAACCAGGGGGATAAAGG 

GTGGGAAAAAGAAGAGCCCACAAAGTAACTCAGGTTGGCTACAAGTTCAT 

GCCMGMTCCTAGGACCTTGTTGTTTAAAGGCTTGTlTrATTTTGTGAA 
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CATGAATGTTAAATGTACATACATGTTAAGTGTATGTATGTACACCATAT 

GCATGCATACAGAATCCAGAAGAAAGTACATTATACCCTGGAATGGAACT 

TAGAGTTGTGAGACAGCATGAGGATGCTGGGAACTGAACCCAGTTTCTCC 

ACMGAGGAGTAGTTGCTCTTCACTCCTTAACCTTTCCTCCAGCCCCAAT 

CCTAGCATTTTGGAGGCTGATGTAGGAAGATTATCCCAAGTGTGAGGTCA 

TCTTGGGCTCCATMTMGTTTMGACCMTCTCAGCTCCAGAGTAGGAC 

CCTGCCTCAAAAACACACAGGTGGAAAGATGGGTGGGCAATGAAGAGCAC 

ACACTGTGCCTCCAGGGGACCCMGCTTGGGTCCAAGCACCCTTGTTGGG 

CAGCTCACAACTGCCTGTAACTCCACCTCCAGAGGATCCTAAGCCACCTT 

CTGGCTTGGCTTCATGGAGGGAACAGGTATGTGGGTATCT-GAGTGTGACG 

AATGAGCAGCAAGTGAGTCraCTGTGGCTAGCACAAAGTATGGGCTGAA 

GAGCAGGAGGACAGCTGAAAAGTGGCCmCCTCGTGACTAAGTTCGTCT 

GAGCAGCTGAGTCAGTrrCTTCCT^CTCCTTGGCTCGTCTCAGT^TTA 

TMGCTGCTCACTTGTMGTCTTTTCCTA€GAGCCCAGC7T"GTCTAG8GG 

TTGTCmGCMCT<^CCTTGTCTGACAGT-GACTTTCAGCA'GTC7TAGCT 

GCTTATATACACAGTCTTAGGAAAGAAGGCTX^TCAATCT^TCCATTTC 

AGGMCTTTCTGMGCTATTCTGMTTTACTTTACAAGCTTACCTGCAGG 

ATAGAGGATCTCAGCTCTTTATAAACATCCTGTCCTAAAAGACCCTGTTG 

TOCTCmTCTmACATCCTGTGTCTOAGAAGmGCCTCCAGGATG 

GMGTTGTTCMTTCAGAt^CACTGTO^ 

TGT<aAGCTCAGTGCTCTCCTT4^^ 
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ATTTTGTCATCATMTCTTTTCCTATATCCTTCCTTGTTCTGGGAACTCA 

TCTGGTTCATTTTTTTGGCATTTTGAGAAAAGCTCTCACTATACAAATCA 

GGCTGCCTCCAAATCATCTTTTTGCCTTAACCTCCTCAGTACCAAGATCA 

CGAGTGGATCTTAACACTTGACTGACTCGTTTAAGTGTGAGGAAATGTGG 

ACCAATAAGAGAGCCCAGGAAAGCCCAGGAGAATCTGTAGCCCCATGGCT 

GTTGTGTCAGMCCCAGAGTTTTGTCMCA6MTTTGGTTCCTAATTTCT 

CCACTTTATAAAAACGAGTGAGAGAAACAGGAACCTATTCAGATCTGGCG 

TCTGAGCAATCAGTGGGTGAACATCTAGAGATCTGTTCTGCATCTCCTCG 

CCAGCTGGCAGAGCATGCGTAAGGCGGGAGGGAACAAGGGCAATCACTCA 

CTCTGGGGCTCAGGCTTGCCCCTTGGGTCAGGTGTTTCTGAGAGACGTGA 

TGTCTGCTTCTCTTGTTACCATCCCTCATCCTCTCCCCTCCTTCTGTCCC 

CTACTTACCAATTTCACTGGCCAGTGTCCATATTTCCTGCAAAAGCGATT 

TGGTTTAATGAGCTTGACTATGCCCGACTCCTTTAGGGAGGGTGGGGAAA 

GGGCAACGAGGGCAGTAAGTGGTTTCCACAACCACTTTGCACCCGGCTGC 

TGGGCCCCAAGCCAGAGGAACGTGCATGAGCCATGAAGTTTCCACTGATA 

AATCCACAGATGCTTCTAGCACCTGCCTTTCTGACTCAGCCTCACCGTGC 

CGCCTGCCAGCTGTGAAATCAGTGCCAACAACAGGTAACCGAGACCCAGG 

CGCAGGGCCAGGACAGCTGTCTGACACTTCCAGACAGGATGTGGAGGCTG 

ACAGTTGTGATGGAGAGGAGATGGGGAGGACAGAGACGGGCTCAGCTTTA 

AGACACCGAGCCACAGAGCACCAAACAAAAGCCAGGGCCTTCTGAGGTAG 

MGTMCAGAAACCAMCAGGCMTTCTACTAGTTTCCT<3GGACT€TTTG 
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CTGCATTTGCCMTCTTGGTAGTTTTAAAAAACAAAAACAGTTT'GTTCTC 
AGCACTGGCAGAGCTTTCCTCCTCTGGAGGCTCCAGGGGTCCAGACTCTC 
CTCTGTGGTACACTGGCTTCAGACATATCTCTTGCCTATGGCTGCCTCAC 
TCTAAACTCTGCCTGTCCTTGAATTACCTCTCTCTGCACTGGCTTTATAA 
AGGAAACATGAGATTGTGTTTAGGGCCTGTTTGGGTGAGCTCCTCAGGAT 
CTATAACATAATCACATCTCTACCGTATGAAGTGACGCTTCCGTCCCAGT 
GTGTAATACATTTGCCGGCGCCTGTCCTTAGGACAGTGACCACCACCAAC 
TGTGGAACTTGACTATGTCCACGTCATCTTCCTACTAGCTTTAGAAGGCT 
TATACCCACACTTTCTATCCAGMTTGTATTTTTATTTAGAATCATTCCT 
ACTTTTAAAAMGTCTCTGTGGTTAAMGCATT<BCAGAGGGCTT<5GGTTT 
TGGTCCCCAGGACCCACATCAAGTGGCTCACAGTGTCCTCGAACTCTTGT 
TCCAATACCCTCTTCTGGTCTCCATAGGCACTACATACATATGGCACATA 
TATGTATACTCAGGCACAC^TGTAMTmMTGTCTACTTTTTATGCTA 
AATATCAAAGTCACTGGAGCAGTGGAGTFGAGCACACTCACATAAGGAAA 
TCATCAGACAGACACTTCATCCTGTGTreGAGCCACTTTGTGGCTGGAGT 
MGCAGGGCAGAGTGATGTTTTCATTACTCTCTGGCCCCAGCACCeCCTG 
CCTCTCCCCACCCATTCGTCCATCCAGGTGGGGAAGAGAATTCTCTTTGT 
GAMTTGGAAGTTTGGACCCAGCTTCACTCTTACTCTGCCCAGTACCTCC 
TGTGAGAMCCCTCCTATCCCAGGTGACCTGCTGGCfGTGACTCTCCTCA 
GCAAAAGGCeCGTGACCCACACTGeGCCACTAATGTATCATCCGCAAATG 
CTGAAAAGGAAGCGTGTCTTCCTCTCT^ 
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T6AGACAGAGTTTCTCTGTATAGCCCTGGCTGTCCTG6AACTCACTTTGT 

AGACCAGGCTGGCCTCGAACTCAGAACTCCGCCTGCCTCTGCCTCCCGAG 

TGCTGGGATTAAAGGCGTGCATCACCACTGCCCGGCTGCGTGTCTTTCTC 

TTAGCGGTCTCTGTGGAGATGCTGAGTATGAAGCTCATCCTACCCACCCT 

TCAGTGGGGCCTTTTCTAGCTACTGAGCAGCTGTGTGAGGACTCGTGATC 

ACAAGGTCCTTTGAACCCTTGAGACAGATGTGCCTGAGCCCAGTTTGACC 

TGACAAAAGCCTAGAGCTCACTGATAATGCCAGCAAACACCATCTTTGAG 

TTTGCAAAGGAATCGCAACACATGCATTCAGTTTCCGTTGCTGGCTGCTG 

CTCCAGAGATGGCTATATTCATTCTCAGGTACTCAGACTCAAGAGTAGTT 

CTGGCCACACAGGTCTCCACATTTCGAGGTCAAATGACAGAAAACCAGGT 

TGGTCTCAGTGCACATGGGTTTATTGAGCCACTGCAGGTGCTGGGGAAAC 

CATGGCAGGGAGATCCTGGGAAGCCAGTGGGGTGCTGAGCAGGAGGGACC 

TCAGTCTCTCCTTAATGTCTACACACTGTGTCATAGGTGACAAGCCACGT 

CAGTGCTGTGACACGGGTAAGCTTAATGGTGAGTAATGGCTAACTGGGAG 

GGTATTTAGGCAGCCTTGTCTGTC AGCCTGTTCATATGATCTCCTTAGTG 

CCTTGTCATCTTGGAAAAGGACAGTTCCAAATTCTAGGAGCGGGGGCTAG 

TCTCTGTCCTGCTCTGTAAGCCCAGGGGACCCAATGAGGCCTCATCTATG 

GGTGCTCAGCTCTAGGATGGGGAAGAAAATGGACAAGATGCCTACTGACG 

GGMCACAGGCTTTTCAGTCAGACCCTAGCCTCCAGCCCCCAATCCAGAG 

GACAGCCACACAGGGGTCCAGGCCTGCAAAGGGCAGCAGACCTGAGGGCA 

AGGGAGTTTCAGCTCAGTGAGCAGTCATCGGGAGACATGGCAGTCAGCTG 
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TGTCGTCCACGGTTCATGTTCCTAATCAGAGCAGGGCCT66AGA6CCAGG 
GCAGTGAGTGCATACAGCCAGGACACCTTGtBGGGTTAGGACAAAACAAGG 
ACTGTTTCTGCCTCCAGCTCTTCTCAGGCCACTCGTGCCTT<3CCTAGGAA 
GGGTAAGAGAGCACAGATGGGAAGGATTGGGAAACTGTCAACTCCCTGTC 
CTCTCCCCATACCTACCC-GCGGGAAACAGCAC€CAGCAt3TCTGGTCCTGC 
AGAACTGATGGCTGCAAGCTGTCAAAGGCTTGTATGGCAGCATCTGCGGA 
GTGCAGAGATCCAGAGAAGGCTTGGCCAGGAAAGCCTAGAAACTACCCCA 
CTCCCTTGGGACAAAAAATAAGACACCCTGGAACCTGCAAGGCATGGGCT 
GAGATGGAAGGTCACTGTGCTAAGAATGACGCACAAACTGCTAGTGAGGT 
TGACMGGGCTGCCCCXTCTCCCTTTACAGGTKaAACACAATGGGGATTAA 
TAAGAGTTTAACTCTCAGCTACTAAGTGGCAGAGACAGGCTTCAAACAGA 
CCCCCAGAMTCTGGMCTGAGCCATTCCACCCAGAGGCAAGAACAGCAG 
AGGTAAGTTGGGCACACATGGAAGAAAGGGCGACCCCATTAGTGTCAAAA 
GGGAGGCCMCTTCAGGCCATTGGACAC<3TTTrMGGCT€ACTTCCAGCC 
ATGTACCATGGCATGTGCACACTGTCCATCGCCCACACCAAACATGAT<5C 
GACGTAAATMGACCCACGG^CCAGGCAGCTT<3GATT<^CCACAGACAT 
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1141m | EXONS 1-12 | EXONS 13-18" 



EXONS 19-28 I EXONS 3' UTR 



SIGNAL P? EGF EGF/METAL 
TM CUB 
TM 



EGF EGF 



FIG.4 



Exon 1 CelegE106 TCTCCTAGTTGTAGTACATGCTGTTG 

CelegE108 AGGTCCTGTCTCAAGAAATAGCAATAAC 

Exon2 CelegE33 TTTGAAGGCCCCTGAAGTCAGAG 

CelegE36 T TGAG TCCCCATCATAAACAT AT AAATGG 

CelegE37 TTCTAGGCCAAATAGAATAATGAGACTTC 

CelegE40 AGAACTAATTCCATGAGATGAGTGTG 

Exon 3 CelegE41 TGAAGTTGCTGTAATCTGGTCTGTG 

CelegE44 AAGGAGCCTGACTAGAAGCCTC 

Exon4 CelegE69 TAAACTCCCTACAGTTCACTAACTCAG 

CelegE72 AGCGCTGTTGAGTGTGAATGTTCTG 

Ce I egE73 AAAGCCACAGTTGTCTGTACAGTGAG 

CelegE76 AGGTCTGCATTAGTTGCAATGTTGC 

Exon5 CelegE77 TATACACCCCCTTATATACACTCAG 

CelegE80 AGAGCCTCTCATAAAGCTGTGGTC 

Ce I egE8 1 TTGAACATATATCCGCCAACAACCC 

CelegE84 CTTGGAATACTATAAACTTTCAGGCTGC 

Exon6 Ce I egE 1 01 TAAAGCAACAGGAAGAG T TGAACT TCTTG 

CelegE104 TGCACCCTGTGTGCACATGG 

Exon7 CelegE109 TTACGGTGTCCTAATAATAAGGGCAG 

Ce I egE 1 1 1 AATCATGGGTATTCTTAACTCCGAAAGC 

Ce I egE 1 1 4 TGTAACAATGTGTGCGGAGTGTCC 

Ce I egE 1 16 TCTCTCTCCAGCCCTAGAGTTG 

Exon8 CelegE86 AGAAGAGGAGCCTGCAACATTGAC 

CelegE88 TTTGTTGGCGCTGAAAGCCTTG 

CelegE89 TCGCCACAGTAGTGTTTATGATGAC 

Ce I egE91 TTAATCAATTGCCTCTGCAGATTCTAG 
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Exon9 


CelegE93 
CelegE95 


TGGCTTACGTATA6GGG6AAATCAAG 
TTGTGTGTGTTCCCTCCAAACACC 




CelegE98 
CelegElOO 


GGACCATTCTTAAGGACAGGeGAT 
ACATAGTGATCTTTCCATCAGCAAAG 


ExonlO 


CelegE117 
CelegE120 


TGAATGCACAGAGACCCTCCTG 
CCTCTTACCATTCAGATACT<5TTAGG 


Exonll 


CelegE121 
CelegE124 


AGCAACAACTCAAACCAGCCCTAC 
TTCTTCAGTTGCCAACTCCCAGG 




CelegE125 
CelegE128 


AAGCTGCTTGTGTGGCAGCAG 
AGTAAGGTGAACAGGAAAGTACAGAG 


Exonl2 


CelegE130 
Cel eaE132 


TACATAAGAGAGGCTGCCGCATAG 
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NHSOYNMVL AYDLASREWLPLNRSVNNVVVRYGHSLAL YKDK I YMYGGKi DSTGNVTNEL 
290 300 310 320 330 340 
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ke \cbA 

430 440 450 460 470 480 

inputs RVFHl HNESWVL L TPKAKEQYAWQMSAH I VTLKNGRVVML V I'FGHCPL YGY 1 SNVOE YD 



RVFH I HNE SWVL L TPKAKEQYAVVG.HSAH 1 VTLKNGRVVML V 1 FGHCPL YGY I SNVOE YO 
350 360 370 380 390 400 

KeldiS 

490 N 500 510 520 530 "540 

inputs LDKNTWS I L HTOG AL VOGG YGjH^SVYDHRT'RAL YVHGGYKAPSANKYRL AOOL YRYDVDT 

LDKNTW5 1 LHTOGALVOGGYGHSSVYOHRTRALYVHGGYKAFSANKYRLADDLYRYOVDT 
410 420 430 4 40 450 460 
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550 560 570 580 590 600 

i nputs QMWT 1 LKDSRFFRYljHpAVI VSGTMLVFGGNTHNDTSMSHGAKCFSSOFMAYD I ACORWS 

(3MWTILKDSRFFRYLHTAVIVSGTMLVFOGNTHN0TSMSHGAKCFSSDFMAY0IAGDRWS 
470 480 490 500 510 520 

ke I ch 1 c t 

610 620 630 640 650 660 

inputs VL PRPDLHHD VNRFG]h$ AVLHNS TMY VFOGFNSLLL SO I L VF TCECfD AHRSE AACLAAG 



VLPRPDSTMMSf DL A I PAVL HI^TMYVPGGF NSL LLSOi-L VF TSEOCOAHRSE AACL AAG 
530 540 550 560 570 530 

pl-exin 1 

670 680 690 700 710 720 

inputs PGI RCVWNTGSSQC I SWALATDEOEEKLK^SECFSKRTLOHDRCDOHTDCYSCTANTNOCH 



PG I RCVWNTGSSQC iSWAL ATDEOEEKLKSE'Cf-SK'RTLOHORCOOHTJX YSCTANTNDCH 
590 600 610 -620 630 [640 

I igond-bimJifK) Kylokine cho 
730 740 750 760 770 780 

inputs WCNDHCVPRNHSCSEGOJ S 1 FR YE NCPKDNPMY YONKK T"SCRSC ALOONCOWE-PRNOEG I 



WCNDHCVPRNHSCSEGO IS I F R YE NCPKONPMY YCNKK T SCRSCALOONCOWE'PRNOEC 1 
650 660 670 mo €90 700 

c-type l-e-c t i n 

790 800 810 -820 830 €40 

i nputs AL]PEN1[CG IGWHLVGNSCLK I T TAKE NYON AKLf CRNHNAL-L ASL TTQKK VE F VL KOL R I 



ALPENIC-G I€WHL VGNSCLK1 T T AKE N YONAKL FGRNHN AL. L ASL T TOKK VCF VLKQLR I 
710 [ 720 730 740 750 760 
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850 860 870 880 890 900 

uts M0SS0SMSK L T L T PWVGL RK I NVSYWCWEDMSPFTNSLLQWMPSEPSDAGFCG 1 LSEPST 



MOSSOSMSKL T L TPWVGLRK I NVSYWCWEDMSPF TNSL LQWMPSEPSDAGFCG I LSEPST 
770 780 790 800 810 820 

pi ex in repeat 2 
910 920 930 940 950 960 

inputs RGLKAATCINPLNGSVCjERPANHS^KOCRTPCALRTACGDCTSGSSECMWCSNMKQCVDS 



RGLKAATC I NPLNGSVCEjRPANHSAKOCRTPCALRTACGOCTSGSSECMWCSNMKOCVDS 
830 840 850 860 870 880 

970 980 990 1000 1010 1020 

puts NAYVASFPFGQCMEWYTMSTCPPENCSGYCTCSHCLEQPGCGWCTDPSNTGKGKCIEGSY 



NAYVASFPFGQCMEWYTMSTCPPENCSGYCTCSHCLEQPGCGWCTDPSNTGKGKCIEGSY 

890 900 910 920 930 940 

lomin-l ike EGF-1 

1030 1040 1050 1060 1070 1080 

i nputs KGPVKMPSOAPTGNFYPOPLLNSSMCLEDSRYNWSF IHC^AfcQCNGHSKCINQSICEKCE 



KGPVKMPSOAPTGNFYPOPLLNSSMCLEDSRYNWSF I HCPACQCNGHSKCINQS1CEKCE 
950 960 970 980 990 1000 
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1090 1100 1110 1120 . 1130 1140 

inputs NLTTGKHCE TC I SGFYGDPTNGGKCCjP^KCNGHASLCNTNTGKCFGTTKGVKGDECQLCE 
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1210 1220 1230 1240 1250 1260 

inputs NFNLN I TWAASFSAGTQAGEEMPVVSKTN I KE YKDSF SNEKFDFRNHPN 1 TF F VY VSNF T 



NFNLN1 TWAASFSAGTQAGEEMPVVSKTN I KE YKDSF SNEKFDFRNHPN I Tf F VYVSNFT 
1130 1140 1150 1160 1170 1180 
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